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GENERAL  FOREWORD 


The  purpose  of  this  Manual  is  to  describe  in  a  series  of  separately-published  Volumes  the 
basic  documentation  practices  which  are  involved  in  the  initial  setting  up,  and  subsequent 
operation  of  an  Information-Library  Organisation  to  provide  defence-aerospace  scientific  and 
technical  information  services. 

•  The  manual  is  primarily  intended  for  the  main  defence-aerospace  information  centres  in 
the  smaller  nations,  and  the  specialised  defence  establishments  and  defence  contractors 
in  the  larger  NATO  countries. 

•  For  those  information  centres  which  already  have  a  well-developed  system,  the  manual 
may  prove  helpful  in  the  work  of  analyzing  and  evaluating  existing  system  performance, 
or  in  revising  an  ineffective  system.  An  important  subsidiary  objective  is  therefore  to 
encourage  the  greater  use  of  modern  techniques  of  information  processing. 

•  The  manual  endeavours  to  meet  the  needs  of  a  wide  spectrum  of  readers  -  the  senior 
man  concerned  with  setting  up  a  new  system,  as  well  as  junior  staff  who  may  be  using 
the  manual  as  a  training  aid. 

•  The  various  Sections  aim  to  focus  on  the  problems  and  techniques  associated  with 
processing  unpublished  reports  and  related  information,  rather  than  conventional  book- 
journal  libraries.  Emphasis  is  on  practical  solutions  and,  where  appropriate,  useful 
operating  suggestions. 

The  manual  has  been  planned  by  the  AGARD  Technical  Information  Panel  and  will 
consist  of  four  Volumes  comprising  twelve  Sections  in  all,  each  prepared  by  a  well-known 
expert  in  the  field.  The  Publication  Layout  is  given  on  the  following  page  and  publication 
of  the  complete  set  will  be  spread  over  three  to  four  years. 


S.C.SCHULER 
General  Editor 
(Former  Chairman,  AGARD 
Technical  Information  Panel) 
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ABSTRACT 


An  introduction  to  the  hardware  and  software  of  computer  systems  is  given,  followed 
by  a  description  of  the  problems  of  the  input  of  data,  including  the  representation  of 
characters  and  the  choice  of  a  character  set.  The  different  types  of  input  and  storage 
equipment  are  described  and  examples  are  given.  Management  and  -ysteins  analysis  problems 
arc  outlined  in  the  project  environment.  Finally,  there  is  a  brief  introduction  to  chemical 
structure  input  Jnd  storage  techniques. 
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I.  INTRODUCTION 

1.1  Contents  of  the  Section 

Hus  Section  of  the  Manual  is  designed  to  give  an  insight  into  the  potential  for  using  the  computer  for  library  and 
information  service  applications.  In  addition,  organizations  which  already  use  computer  systems  may  want  to  improve 
t  lem  and  it  is  hoped  that  the  Section  will  be  of  use  to  them,  too.  Hence,  some  subjects  are  treated  at  a  superficial  level 
and  others  in  greater  depth.  It  begins  with  a  brief  introduction  to  some  of  the  ways  in  which  a  computer  might  be  used 
in  Lbranes  and  information  systems.  It  is  followed  (Chapter  two)  by  a  simplified  explanation  of  the  computer,  designed 
to  give  those  unfamiliar  with  the  concepts  an  insight  into  the  major  hardware  and  software  points.  Chapter  three  deals 
with  the  problem  of  data  input,  including  choice  of  a  character  set,  data  labels,  fom.ats  and  verification  techniques.  This 
is  followed  by  a  review  of  types  of  input  equipment  designed  to  help  in  the  selection  of  equipment  (Chapter  four). 
Chapter  five  contains  a  short  introduction  to  computer  storage  devices.  Microforms  are  not  covered  as  this  will  be  dealt 
with  m  Volume  ill  of  the  Manual.  Chapter  six  deals  with  the  management  aspects  of  the  introduction  and  use  of 
computer-based  information  systems.  Finally,  there  is  a  short  overview  of  chemical  information  input  and  storage. 

An  attempt  has  been  made  to  keep  the  explanations  as  simple  as  possible  and  hence,  in  order  f  get  a  more  detailed 
picture,  the  reader  should  consult  some  of  the  many  works  available,  including  those  cited  in  the  bibliography.  For 
problems  with  terminology ,  the  reader  is  advised  to  turn  to  Gibbs'  (AGARDograph  182).  The  ‘Encyclopaedia  of 
Computer  Science'  by  Ralston  and  Meek2  is  also  a  useful  one-volume  source  of  information.  ‘The  Encyclopaedia  of 
Library  and  Information  Science'1  is  a  comprehensive  multi  volume  work,  useful  for  explaining  individual  points  which 
might  not  be  clear. 

1.2  The  Use  of  Computers  —  Some  Considerations 

During  the  past  decade  or  more,  the  computer  has  played  an  increasingly  important  part  in  the  services  provided  by 
library ,  documentation  and  information  centres,  its  role  is  considered  to  be  essential  in  many  organizations  and  is  likely 
to  increase  in  importance  as  facilities  become  more  sophisticated  and  equipment  becomes  cheaper.  Even  now,  it  is 
important  for  management  to  examine  the  potential  of  using  the  computer  for  solving  the  problems  of  information 
storage  and  retrieval.  It  will  not  be  long  before  organizations  which  do  not  use  computers  will  be  at  a  serious  disadvan 
tage  compared  with  more  go-ahead  competitor  Managers  who  do  not  realize  the  potential  of  computers  as  a  resource 
will  lose  out  to  others  who  take  advantage  of  them.  Properly  managed,  the  computer  will  provide  a  valuable  aid  to  make 
the  information  service  more  efficient  and  effective,  complementing  and  making  the  information  staff  more  effective  by 
giving  them  a  powerful  tool  to  assist  them  in  their  work. 

Properly  managed'  are  important  keywords,  for  the  computer  is  something  which  cannot  be  approached  casually. 
Computers  have,  in  the  past,  cieated  big  problems  for  unptvpared  managers.  Fortunately,  the  techniques  of  systems 
analysis  and  project  management  are  today  on  a  sounder  footing  as  a  result  of  past  mistakes. 

Some  of  the  gains  a  computer  can  bring  are: 

they  are  consistent  and  generally  more  accurate  in  undertaking  routine  tasks  than  human  beings, 

-  they  can  undertake  clerical  tasks  much  more  quickly  than  human  beings; 

the  information  stored  can  be  used  flexibly.  A  single  record  can  be  used  many  times  and  for  different  applica¬ 
tions.  For  instance,  a  record  of  a  recently  acquirtd  book  or  report  can  be  used  to  rapidly  produce  a  list  of  new 
acquisitions  and  for  a  catalogue  of  the  holdings  of  the  library; 

machine-readable  information  can  be  taken  from  or  given  to  othei  organizations,  e.g.,  centrally-  produced 
cataloguing  records  (MARC  Machine-Readable  Catalogue  of  the  US  Library  of  Congress)  or  records  from  an 
abstracting  service  (e.g.,  Chemical  Abstracts  Service). 

There  are  disadvantages  compared  with  manually-held  files: 

-  the  computer  has  to  be  told  exactly  how  to  perform  its  work; 

-  the  machine  sometimes  breaks  down; 

-  the  files  of  information  in  the  computer  are  not  always  completely  up  to  date; 

-  capital  outlay  can  be  expensive; 

-  the  adoption  of  a  computer  can  mean  changes  in  staffing. 

1.3  Computers  for  Library  and  Information  Services 

Libraries  and  information  servn.es  have  files  of  information  which  arc  designed  to  serve  particular  purposes,  for 
instance: 

-  a  list  of  users, 

-  a  catalogue  of  the  stock  of  a  library, 

--  subject  indexes, 

-  a  bibliography  of  articles  in  a  particular  subject  field. 
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Computers  can  be  used  to  store  audi  information  and  in  many  cases,  do  so  with  advantages,  compared  with  manually 
maintained  files,  such  as  card  catalogues.  Some  of  the  ways  that  computers  are  effectively  used  in  libraries  and  informa 
tion  services  are: 

the  production  of  abstracts  journals  and  the  associated  indexes.  The  same  infoimation  can  be  stored  in  the 

computer  and  searches  can  be  made  by  putting  questions  directly  to  the  computer; 

the  production  of  libra.y  catalogues.  The  same  information  can  be  searched  for  directly  in  the  computer  (see 

Appendix  C); 

-  ordering  systems  for  library  materials; 

-  library  circulation  systems; 

the  storage  and  retrieval  of  scientific  data  e.g.  the  structure  and  function  of  chemical  substances. 

The  two  basic  ways  of  using  a  computer  are: 

-  batch  processing, 

-  on-line  mode. 

Using  batch  processing,  many  similar  units  of  information  are  collected  together  and  are  processed  b\  the  computer  at  the 
same  time  (in  batches).  This  technique  has  the  advantage  of  being  less  expensive  than  on-line,  real  time  processing,  but 
the  latest  information  is  not  available  until  a  batch  has  been  processed.  On-line  mode  can  be  defined  in  different  ways, 
but  basically,  a  user  has  a  visual  display  terminal  which  permits  an  interaction  with  the  information  stored  in  the 
computer.  Some  systems  allow  the  u.,er  to  retrieve  information  within  a  short  time,  although  modifications  and  new 
information  have  to  be  entered  in  batches.  Some  other  systems  allow  both  entry  and  retrieval  to  be  made  immediately. 
When  units  of  information  are  treated  individually  by  the  computer  the  technique  is  called  transaction  processing,  i.e., 
the  computer  iesponds  in  real  time  within  a  few  seconds. 

The  facilities  which  are  justifiable  for  a  particular  application  will  depend  on  the  needs.  Tor  instance,  it  is  a  require¬ 
ment  of  an  airline  seat  reservation  system  that  a  cancelled  seat  be  immediately  available  and  hence  immediate  updating  .s 
required.  Increased  demands  lead  to  increased  costs  because  the  computer  system  needs  to  be  more  sophisticated. 

Because  of  the  power  of  larger  computers,  a  number  of  users  can  be  served  simultaneously.  This  can  be  an  efficient  way 
of  using  a  computer  system,  i.e.,  to  share  its  resources  with  other  users.  The  main  ways  of  acquiring  computer  power  are. 

to  purchase  services  from  a  computer  bureau  (this  can  include  personnel  as  well  as  computer  power), 

-  to  use  part  of  a  computer  available  within  ihe  institution; 

-  to  have  a  computer  dedicated  to  the  library  and  information  service. 

The  increasing  power  and  decreasing  costs  of  minicomputers  are  making  the  last  alternative  more  attractive.  The  subject 
has  been  Seated  in  AGARD  publication  LS-92  (Ref. 4).  These  problems  are  dealt  with  in  more  depth  in  Chapter  six. 

External  computer  based  seivices  ale  now  available  which  allow  access,  using  a  terminal,  to  computer-based  informa¬ 
tion  systems.  Thus  it  is  possible  to  search  a  wide  range  of  bibliographic  data  bases  and  factual  data  banks  to  obtain  the 
references  to  documents  of  interest  and  factual  information.  The  computer  may  be  many  kilometers  away  from  the 
terminal,  the  information  being  carried  on  communications  lines,  often  on  the  normal  telephone  network.  The  user  is 
charged  for  the  services  used.  With  such  external  services  it  is  only  necessary  to  acquire  a  terminal  and  learn  how  to  use 
it.  On  line,  real  time  access  and  the  use  of  a  terminal  are  at  opposite  extremes  in  terms  of  the  commitment  of  the 
organuation  to  the  use  of  the  computer.  An  information  service  will  have  to  decide  the  level  of  commitment  it  wishes 
to  have,  by  examining  costs,  performance  and  benefits. 


2.  COMPUTER  HARDWARE  AND  SOFTWARE 

2.1  Components  of  a  Computer 

A  computer  is  a  device  in  which  information  can  be  stored  and  organised,  and  from  which  it  can  be  retrieved  and 
displayed.  Simply  speaking,  a  computer  consists  of  the  following: 

hardware,  the  physical  equipment  which  stores,  manipulates,  transmits  and  transforms  information, 

-  software:  the  sets  of  instructions  which  tell  the  computer  what  to  do. 

In  practice,  hardware  and  software  are  two  inter  related  subsystems,  both  of  which  are  necessary  for  the  correct 
functioning  of  the  machine. 

2.2  Computer  Hardware 

Tile  computer  contains  a  number  of  devices  which  transduce  (i.e.  transfo'm  data, information  from  one  physical 
form  to  another),  store,  transform,  communicate  arid  control.  An  outline  of  the  components  of  a  computer  are  shown 


■lrN  i  n  ■  .  i-ren'  VrtaC.  f  ,i  m'  i'MyflTlirfiit-YlliaVr,  ■  (Wrt'iM  VfcC- jin  ciiirri'frv  hnr  - ....  — ’Vc 
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in  Figure  1 .  The  devices  wliich  make  up  a  computer  arc  described  in  detail  by  Mao5  and  they  have  the  following 
functions: 

-  storage, 

-  input, 

-  output, 

-  control, 

-  arithmetic  and  logic  operations, 

-  communications. 


central  processing  unit 


Fig  1  The  major  hardware  components  of  a  digital  computer 


Storage  devices  hold  the  information  that  has  been  put  into  a  computer.  Information  is  stored  and  processed  as 
binary  numbers  using  'bi-stable’  devices.  An  electric  light  switch  can  be  regarded  as  such  a  device,  for  it  can  either  be  off 
or  on.  If  the  number  T’  is  equivalent  to  ‘on’  and  ‘0’  to  off,  then  a  number  could  be  stored  in  a  series  of  such  devices 
(bits).  Letters  of  the  alphabet,  numbers  and  other  symbols  are  all  reduced  to  representative  numbers  in  the  computer. 
This  is  explained  in  more  detail  in  chapter  three.  The  physical  characteris*ics  of  storage  devices  usually  determine  the 
speed  with  which  data  can  be  retrieved  (access  time),  the  amount  of  data  which  can  be  stored  and  the  amount  of  data 
which  can  be  transferred  from  the  device.  Main  memory  is  used  to  store  programs  and  the  information  which  is  currently 
being  processed  and  hence  the  speed  of  access  is  extremely  fast. 

A  number  of  techniques  are  used  to  produce  main  memories,  the  most  widely  known  of  which  is  the  ferrite  core. 

A  store  consists  of  a  series  of  corev  each  of  which  can  be  magnetized  in  one  of  two  directions  to  store  either  a  ‘1  ’  or  a 
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‘O’.  Semiconductor  memories  are  increasingly  being  used;  these  arc  binary  digital  memories  which  use  an  electronic 
circuit  for  each  memory  cell.  Discs,  drums  and  magnetic  tapes  are  examples  of  storage  devices  which  hold  more 
information  than  main  memory  but  which  have  a  slower  access  time.  They  are  often  used  to  store  files  of  data.  Storage 
devices  arc  discussed  in  more  detail  in  chapter  five. 

Input  devices  are  used  to  put  the  information  into  the  computer.  Punched  card  machines  and  visual  display  units 
are  examples  of  such  transducers  by  which  information  is  transferred  from  paper  into  machine  readable  form.  Each 
individual  letter,  number,  symbol  etc.  is  translated  into  a  corresponding  pattern  of  ones  and  zeros.  Input  devices  are 
discussed  in  chapter  four  and  the  problems  of  putting  data  into  the  machine  is  described  in  general  in  chapter  three. 

Output  devices  are  used  tc  display  information  and  a  variety  of  machines  are  available.  Printers  with  speeds  ranging 
from  tens  of  characters  per  second  to  20,000  lines  per  minute  are  in  use.  Video  display  units  can  be  used  for  displaying 
information  in  alphanumeric  or  in  graphical  form.  Microfilm  can  be  produced  directly  by  COM  (computer-output- 
microfilm)  and  graphs  can  be  plotted  using  graph  plotting  equipment.  Computer  typesetting  devices  are  used  to  produce 
printed  texts  directly  from  the  computer. 

The  control  unit  issues  commands  and  control  signals  to  other  functional  units.  For  instance,  it  fetches  instructions 
from  main  memory,  decodes  them,  selects  the  part  of  the  arithmetic  and  logic  unit  to  carry  out  a  calculation  and 
transmits  the  data  to  it. 

The  arithmetic  and  logic  unit  is  that  part  of  the  machine  in  which  all  arithmetic  and  logical  operations  are  carried 
out.  The  arithmetic  and  logic  unit,  together  with  the  control  unit  and  main  memory  are  often  referred  to  as  the  central 
processing  unit  (CPU). 

Communications  between  functional  units  is  supervised  by  the  control  unit  (sec  Figure  1).  Peripherals  (c.g„ 
magnetic  discs  or  visual  display  units)  may  demand  attention  by  interrupting  the  CPU.  Computer  peripherals  may  be  at 
some  distance  from  the  CPU  and  communicate  via  telecommunications  lines  or  specially  constructed  networks.  Some¬ 
times  a  small  computer  (called  a  front-end  processor)  is  attached  to  the  main  computer  to  handle  video  display  terminals 
and  other  computer  peripherals  which  arc  at  a  distance  from  the  main  computer. 

Details  of  most  of  the  hardware  produced  by  the  major  manufacturers  are  available  in  the  Auerbach  computer 
technology  reports.6 

2.3  Computer  Software 

Computer  software  is  the  means  used  to  tell  the  computer  what  to  do.  It  consists  of  programs  which  comprise 
logical  series  of  instructions  written  in  a  language  convenient  to  human  beings.  Fortran,  Cobol  and  PL1  arc  well  known 
languages.  A  program  is  translated  into  the  language  of  the  machine  using  another  program  called  a  compiler.  The 
translated  program  consists  of  a  series  of  very  specific  instructions  (e.g.  add,  divide,  etc.)  which  are  stored  in  main 
memory  and  arc  executed  sequentially  by  the  arithmetic  and  logic  unit.  For  a  more  detailed  description  sec  Raison2. 

A  major  part  of  the  computer  system  is  the  operating  system,  a  series  of  program.,  which  also  reside  in  main  memory. 
These  programs  are  designed  to  undertake  the  various  supervisor)'  and  management  tasks  designed  to  maximize  the 
efficiency  of  the  computer.  For  instance  the  operating  system  decides  where  in  main  memory  a  program  should  reside. 
Operating  systems  vary  greatly  in  their  capabilities  and  are  much  more  sophisticated  for  largo  computers  than  for  small. 
For  instance,  some  systems  provide  for  on-line  real-time  working  and  others  only  for  batch  processing.  Coffman7  covers 
the  subject  of  operating  systems  in  detail. 

Computers  usually  have  a  range  of  software  which  is  designed  to  ease  the  task  of  programming.  Examples  of  tasks 
undertaken  by  such  supporting  programs  are: 

sorting  of  information  into  a  specified  order  (sort  programs), 

data  base  management, 

compilers. 

Depending  on  the  task  in  hand,  some  of  the  above  programs  are  used  by,  or  included  alongside,  the  application  program, 
because  of  the  increasing  cost,  complexity  and  importance  of  software  it  is  often  divided  by  the  manufacturer  into 
smaller  parts  which  arc  then  sold  separately  (unbundling). 

Supporting  software  and  packages,  which  can  be  used  on  their  own  to  perform  specific  tasks,  are  available  from 
sources  other  than  the  manufacturer  of  the  computer  system.  Organizations  exist  which  produce  software  products  and 
sell  them.  Goetz*  reviews  the  future  prospects  of  the  software  products  industry.  Packages  can  be  the  best  approach  to 
software  production  because: 

software  products  are  usually  sold  with  a  guarantee  and  are  maintained; 
detailed  documentation  is  usually  available; 

new  versions  become  better  but  are  made  compatible  with  older  versions; 
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-  if  a  package  is  distributed  widely,  the  purchase  price  can  bo  reasonable; 

-  a  general  package  is  usually  reliable  because  it  has  many  users. 

However,  software  packages  may  not  be  able  to  cater  for  specific  local  needs. 

Data  base  management  systems  arc  important  pieces  of  supporting  software  which  help  in  the  use  of  the  computer. 
Using  DBMS,  information  can  be  controlled,  manipulated  and  retrieved  in  an  optimal  fashion.  DBMS  are  worthy  of 
mention  because  the  choice  of  the  correct  one  can  be  a  critical  factor  in  determining  how  much  effort  will  be  needed  to 
set  up  and  operate  computer  systems.  Because  they  arc  supporting  softwaie,  a  certain  amount  of  programming  will  need 
to  be  done  and  that  amount  will  depend  to  a  large  extent  on  the  facilities  provided  by  the  DBMS.  Some  DBMS  arc  more 
suited  to  bibliographic  applications  than  others.  In  fact  there  arc  DBMS  which  have  been  devised  especially  for  such 
applications  For  instance,  the  GRIPS  (General  Relation  based  Information  Processing  System,  produced  by  the  Deutsches 
Institut  filr  Medizinischc  Dokumentation  und  Information)  has  facilities  which  arc  specially  designed  for  managing 
bibliographic  data  bases  An  outline  of  the  features  is  shown  in  Figure  2.  GRIPS  permits  the  user  to  input  information 
on-line,  to  store  it  in  the  form  of  relations  (see  Reference  9)  and  retrieve  the  data  with  an  associated  information  retrieval 
system  A  report  writer,  which  facilitates  the  compilation  of  special  listings  in  a  desired  format  and  of  statistical  informa¬ 
tion  is  also  available. 

Somewhat  similar  is  the  software  produced  by  the  International  Development  Research  Centre  in  Canada  (IDRC)10. 
The  software  is  more  orientated  towards  library  applications  than  GRIPS,  but  it  is  also  built  on  a  rclatio  ial  DBMS.  The 
system  is  novel  in  that  it  is  implemented  on  a  minicomputer.  A  number  of  DBMS  arc  available  for  mini  ompuicrs,  but 
their  capabilities  need  to  be  examined  even  more  carefully  than  those  for  larger  machines  for  they  may  after  fewer 
facilities  Indeed  IDRC  decided  to  build  their  own.  A  survey  of  commercial  DBMS  for  minicomputers  lias  been  made 
by  Groscli".  Auerbach6  surveys  DBMS  in  general.  A  number  of  books,  for  example, is,i3,u  (rcat  t|lc  problems  of  data 
base  management  systems. 


DINUPS:  INPUT  AND  UPDATE  SYSTEM 

DDBG:  DATA  BASE  GENERATOR 

D1VS:  INFORMATION  VAL1DING  SYSTEM 

DIRS  2:  INFORMATION  RETRIEVAL  SYSTEM  2 

DLPG:  LIST  PROGRAM  GENERATOR 

DIAS:  ADMINISTRATION  SYSTEM 


Fig,2  The  components  of  the  DIMD1  relation-based  data  base  management  system,  GRIPS 
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3.  INPUT  OF  DATA  TO  A  COMPUTER 

3.1  The  Problem 

In  order  to  store  information  in  a  computer,  the  information  has  first  of  all  to  be  put  into  the  machine.  This  chapter 
deals  with  input  and  the  associated  problems  such  as  making  sure  it  is  correctly  recorded,  and  considers  how  the  lntonna 
tion  must  be  represented  in  order  to  be  of  most  use. 

3.2  Internal  Computer  Character  Representation 

There  are  several  ways  of  representing  information  in  the  computer  and  the  best  way  depends  on  the  nature  of  the 
information  to  be  presented  to  the  machine.  For  the  most  part,  information  for  documentation  is  in  the  form  of  symbols 
i.e.  letters,  numbers  or  special  characters.  Computers  are  designed  to  manipulate  such  information,  they  have  a  character 
set’  or  standard  set  of  symbols,  each  of  which  has  a  corresponding  pattern  of  bits.  Lynch1'  deals  with  the  problem  of 
character  manipulation  in  a  computer.  An  example  of  a  set  of  characters  in  wide  use  is  that  adopted  by  the  International 
Organization  of  Standardization  (ISO)  (Ref.  16).  The  set  of  characters  is  shown  in  Figure  3.  It  has  been  adopted  by  many 
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Fig.3  International  reference  version  of  the  7-bit  coded  character  set  for 
information  processing  interchange  (ISO  64 0) 


countries  for  use  as  a  national  standard,  the  best  known  being  the  ASCII  (American  Standard  Code  for  Information 
Interchange)  character  set.  The  information  is  contained  in  7  bits  which  gives  a  total  of  128  (21)  different  characters. 
For  instance: 


Code  (bits) 


Character 

Binary 

Decimal 

A 

1000001 

65 

B 

1000010 

66 

a 

li 00001 

97 

b 

1100010 

98 

1 

0111111 

63 

2 

01 10010 

50 

The  character  set  used  by  IBM  is  called  EBCDIC  (Extended  Binary  Coded  Decimal  Interchange  Code).  This  is  an  eight 
bit  code,  giving  28  or  256  possibilities.  An  8  bit  version  of  ASCII  is  also  available. 

A  knowledge  of  the  codes  used  to  represent  characters  is  important  especially  from  the  point  of  view  of  the  order  in 
which  the  characters  appear  (code  sequence).  For  instance,  in  ASCII,  the  numbers  have  lower  codes  than  letters  (A~65, 
2=50):  the  opposite  is  the  case  of  EBCDIC.  Since  sorting  of  characters  is  performed  on  the  value  of  the  codes,  the 
desired  order  of  alphabetization  may  have  to  be  imposed  using  special  techniques. 

3.3  Choosing  a  Character  Set 

The  size  of  the  character  set  required  will  be  dependent  on  the  application.  For  many  applications,  the  normal  ISO 
character  set  is  sufficient.  For  other  applications,  e.g.  for  the  representation  of  mathematical  equations,  a  larger  set  will 
be  required.  The  size  of  the  internal  character  set  is  often  not  the  limiting  factor  from  the  point  of  view  of  the  computer. 
Input  and  output  devices  often  restrict  the  character  set  further.  Some  devices,  such  as  the  card  punch  and  line  printer, 
often  have  upper  case  only. 

There  are  many  applications  for  which  it  may  be  necessary  to  have  a  character  set  larger  than  the  internal  set.  In 
computer  typesetting  operations,  for  instance,  not  only  must  a  larger  set  be  stored  (e.g.,  Greek  letters)  but  information 
on  the  type  font  (e.g.,  itallic,  bold)  must  be  given.  In  such  cases,  special  conventions  are  used.  For  instance,  for  the 
publications  of  Inspcc  (The  Institution  of  Electrical  Engineers  in  the  United  Kingdom) 

$u2  is  used  for  superscript  2 
SgS  is  used  for  Greek  letter  2 

Special  programs  have  to  be  written  to  manipulate  these  characters. 

Many  information  centre  and  library  applications  can  be  covered  by  the  normal  character  set.  The  set  used  at  the 
United  Kingdom  Defence  Research  Information  Centre  (DRIC)  (Ref.  1 7)  is  shown  in  Figure  4.  Special  character  problems 
are  mainly  dealt  with  by  spelling  out  those  which  cannot  be  represented,  as  shown  in  Figure  5. 

For  example,  H2S04  becomes  H2S04 
60°C  becomes  60  deg  C 

12345678900  © 

+  ©/$£=©  ©  (  )  cr  © 

qwerty  uiop' 

QWERTYUIOP" 
asdfghjkl  —  ; 

ASDFGHJKL_: 

zxcvbnm,.© 

ZXCVBNM®.® 

Note:  Characters  ringed  should  not  be  used  in  abstracts 

Fig.4  The  Datamatic  tape-typewriter  keyboard  character  set  as  used  at  the 
United  Kingdom  Defence  Research  Information  Centre 
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1.  Angstrom  unit  A:  replace  by  “A". 

2.  Chemical  formulae:  write  on  one  line,  eg  112  SO.,  becomes  112S04. 

3.  Common  logarithms  (base  10):  use  “log*'. 

4.  Natural  logarithms  (base  c):  use  “In", 

5.  Square  roots  \/  use  “square  root  of. 

6.  use  either  “and”  or  as  appropriate. 

7.  >:  use  “>”  or 
S.  <:  use  “<”  or 

a-  b 

9.  Fractions:  use  an  oblique  stroke  and  brackets  if  necessary,  eg -  becomes  (a  b)/c. 

c 

10.  use  “yields”  if  chemistry,  “approaches”  if  mathematics. 

11.  #:  use  “no”. 

12.  Degrees0:  use  “deg”. 

13.  Subscripts:  use  "sub”,  eg  Cl  becomes  CsubL. 

14.  Exponents:  use  “to  the  power”,  eg  x4  becomes  x  to  the  power  4.  However,  numbers 
involving  *  powers  of  10  may  usefully  be  simplified,  eg  1.19  x  1  O'*  becomes  1 1,900. 

1 5.  The  **  question  mark:  omit  in  title,  and  rephrase  abstract  to  make  it  unnecessary. 

16.  Greek  letters:  use  names,  in  general.  However,  it  may  be  more  useful  to  name  the 
quantity  concerned,  eg  wavelength  (X),  frequency  (i»).  angular  velocity  (w)  etc. 

17.  use  “infinity”. 

18.  /:  use  “the  integral  of*. 

19.  ■>,  use  “per  cent”. 

20.  i:  use  1/2. 

Notes:  *  positive  or  negative 

**  used  as  a  field  terminator 

Fig,5  Representation  of  special  symbols  in  the  United  Kingdom  Defence 
Research  Information  Centre  data  base 


3.4  Labels  and  Formats 

A  computer  program  which  processes  data  needs  to  bo  able  to  recognise  the  data  with  which  it  is  working.  Individual 
pieces  of  information  (sometimes  called  data  elements)  are  usually  labelled  for  this  reason.  Information  is  often  entered 
onto  a  specially-prepared  form  and  a  place  is  reserved  for  each  data  element.  An  example  of  a  form  used  for  input  is 
shown  in  Figure  6  (The  System  for  Documentation  in  Metallurgy).  Each  data  clement  is  given  a  three-digit  number 
(called  a  ‘tag’)  e.g.,  CODEN  has  a  tag  of  102  in  the  example  in  Figure  6.  Note  that  individual  authors  are  separated  by 
a  'S’  and  the  author  affiliation  appears  under  the  same  tag.  Further  examples  are  shown  in  Appendix  B. 

It  is  important  to  decide  what  a  data  clement  comprises.  For  instance,  should  all  the  authors  of  a  document  be  given 
a  single  tag  (as  in  the  example  in  Figure  6)  or  should  each  author  be  given  one’  Increasing  the  depth  of  labelling  increases 
the  work  required  to  input  ami  process  the  information.  On  the  other  hand,  if  there  are  fewer  elements,  i.e.  if  elements 
which  are  put  together  only  have  one  tag,  the  computer  might  not  be  able  to  do  what  the  user  wants  because  it  cannot 
separate  the  elements. 
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|  EN.-Antimony(TII)  and  antimony (V)  in  solution  can  be  separated  by  I 

adjusting  the  pH  to  9.5  and  stoking  with  sodium  diotliyldithloosrbama to  (NallDC)  and 
polyurettone  foam.  <U tentatively,  the  solution  can  be  passed  through  a  column  of  foam 
that  has  previously  been  treated  with  5%  m/V  of  NaTOC  in  carbon  tetrachloride .  The 
Sb(DDC)^  complex  retainer!  by  the  foam  can  bo  eluted  with  acetone.  DBC  complexes  of 
iron(UI),  mercury  and  silver  are  also  absorbed,  but  are  ttnlikely  to  interfere  In  the 
subsequent  determination  jf  antimony-  by  either  atomic-absorption  .spectrophotometry  or 
neutron-activation  analysis . 

’  Rolotencos 

12  Figures  Tables  3  i 

Fig.6  An  example  or  input  to  the  System  for  Documentation  in  Metallurgy  V(SD1M) 
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The  depth  of  labelling  or  tagsing  is  also  dependent  on  the  format  of  each  data  element.  Implicit  labelling  can  some¬ 
times  be  introduced  into  data  elements  by  the  use  of  standard  syntax  which  reduces  the  amount  of  explicit  tagging 
required.  For  a  group  of  authors; 

Smith,  A.U;  Jones,  l’J. 

the  surname  is  separated  from  the  initials  by  a  comma,  initials  are  separated  by  a  full  stop  and  authors  arc  separated  by  a 
semicolon  The  use  of  such  standard  syntax  allows  a  program  to  pick  out  tl-,t  surname  of  each  author.  Standards  amt 
guidelines  for  the  representation  of  bibliographic  data  are  available,  for  instance  in  the  UN1SIST  Reference  Manual18  and 
the  ISBD19  (International  Standard  for  Bibliographic  Description),  it  is  useful  to  produce  a  ’data  dictionary’ m  which  the 
content  and  format  of  each  element  are  described,  together  with  examples.  This  document  will  serve  as  a  standard 
reference  for  all  persons  concerned.  An  example  is  shown  in  Appendix  B. 

3.5  Accuracy 

Once  information  has  been  input  to  the  computer,  it  must  be  validated  am  checked  for  accuracy  (data  validation). 
The  techniques  used  for  correction  of  errors  will  depend  on  the  equipment  available,  for  instance  a  visual  display  unit 
attached  to  a  computer  can  be  used  to  perform  an  immediate  check.  Corrections  can  also  be  made  at  that  time  unless 
extensive  proof  reading  is  required.  Hardware  aspects  are  dealt  with  in  more  detah  in  chapter  four. 

A  number  of  techniques  exist  for  using  the  computer  for  data  validation.  For  instance,  the  addition  of  a  check  digit 
to  a  number  allows  the  computer  to  check  that  a  number  has  been  keyboarded  correctly.  How  this  is  done  is  explained 
in  Figure  7  which  shows  how  to  calculate  a  check  digit  tar  an  International  Standard  dock  Number  (ISBN).  The 


The  International  Standard  Book  Number  i ISBN)  is  a  number  that  identifies  one  title, 
or  edition  of  that  title  if  there  is  more  than  one,  or  volume  of  a  multi-volume  work,  from 
one  specific  publisher,  and  is  unique  to  that  title  or  edition  or  volume.  The  construction 
of  an  ISBN  is  explained  in  British  Standard  1VS  4762:  1471.  The  check  digit  of  an 
ISBN  number  is  used  to  guard  against  errors  i,»  transcription,  it  is  calculated  in  the 
following  manner: 

A.  Bach  digit  in  the  group,  publisher  r-rni  title  number  is  multiplied  by  a  “weight”. 

The  first  digit  is  multiplied  by  10.  the  second  by  9,  the  third  by  S,  and  so  on. 

For  example,  0  S  3  5  2  000  1  ISBN 

10  9  S  7  65  43  2  weights 

~0  72  24  35  imCUU 

B.  The  products  of  the  multiplication  are  added  up,  thus: 

0  +  72  +  24  4-  35  +  12  +  04-0  +  0  +  2  =  145 

C.  The  result  is  divided  by  1 1  to  find  the  remainder: 

145  +  11  =  13  with  a  remainder  of  2 

1),  The  remainder  is  subtracted  from  1 1  to  find  the  required  check  digit*: 

11-2  =  9 

♦If  the  result  of  subtracting  the  remainder  from  1 1  is  10  the  check  digit  is  written 
as  X,  thus  maintaining  the  ISBN  at  a  standard  length  of  ten  digits. 

B.  The  check  digit  is  appended  to  other  digits  to  complete  the  ISBN, 
eg.,  ISBN  0-S352-0001-9. 


Fig.7  Calculation  of  ISBN  check  digit 


technique  is  a  valuable  one  because  it  is  usually  important  to  have  an  accurate  and  unique  identification  code  tor  each 
item  Some  data  validation  techniques  are  given  in  Table  1 .  The  possible  checks  w  ill  depend  a  large  extent  on  the  oat  a 
being  treated  Hines20  suggests  that  an  empirical  approach  to  development  of  error  checking  .onlines  should  be  adopted 
with  emphasis  on  the  convenience  of  the  person  doing  the  input  rather  than  the  programmer.  Hart51  describes  the  steps 
taken  to  input  data  for  the  production  of  an  abstract  journal.  Details  of  the  input,  validation  and  error  correction 
routines  are  given  and  it  is  a  usefiil  description  for  the  organizations  with  little  or  no  practical  experience.  Hie  United 
States  Defense  Documentation  Center51  has  produced  a  data  input  manual  which  describes  the  error  routines  used  in 
conjunction  with  their  on-line  input  system. 


TABLE  1 


Some  Data  Verification  Techniques 


Verification  technique 


Description 


Proof  reading  -  Manual  check  of  tiie  input  against  the 

original 

Key  data  twice  -  Machine  compares  two  versions 

(used  mainly  on  card  punch  devices) 


Consistency  and  plausibility 
by  program: 

1 .  Check-digits 

2.  Plausibility  of  data 

3.  Presence  of  certain  data 
elements 


see  ISBN  example  in  Figure  7 

e.g.,  dates  in  a  particular  range,  page 
numbers  that  follow 

certain  elements  may  have  to  appear 
or  certain  combinations  e.g.,  a 
document  may  always  have  to  have 
a  title 


4.  Syntax  checks 

5.  Sequencing 

6.  ‘Hash’  totals 

7.  Type  of  data 

8.  Field  length 


-  Authors  name  followed  by  a  comma, 
spaces  after  punctuation 

-  A  sequential  number  given  to  lines 
of  data  allows  the  sequence  of  lines 
to  be  checked 

-  the  computer  counts  the  number  of  lines 
or  records  input  and  compares  this 
against  a  given  total 

-  e.g.,  a  data  element  may  be  all  numbers 
or  just  letters 

-  for  Fixed  length  data  elements,  that 
the  length  is  given;  for  variable  length 
elements,  that  length  is  not  greater  than 
maximum 


9.  Use  of  authority  files  -  the  content  of  data  elements  e.g., 

thesaurus  terms  can  be  checked  against 
a  master  list  of  terms 


4.  INPUT  EQUIPMENT 

4.1  Introduction 

The  GIGO  (Garbage  in  G-rbagv  out)  principle  serves  only  to  underline  the  importance  of  having  a  system  which 
ensures  that  information  is  accurately  transferred  into  machine-readable  form.  The  traditional  methods  v.punched  card 
and  papei  tape)  are  being  superseded  by  more  modem  techniques  including  key  to  magnetic  storage,  on  line  input  and 
optical  character  recognition.  This  chapter  describes  the  various  equipment  in  use  for  information  systems  and  compares 
them.  Graphic  displays  are  not  included.  The  methods  of  input  included  are: 

-  punched  card, 

-  punched  paper  tape, 

-  key-to-magnctic  storage  devices, 

-  on-line  input, 

-  optical  character  recognition. 

4.2  The  Punched  Card  (IBM  card) 

The  punched  card  was  perhaps  the  earliest  medium  for  recording  informuti  m  in  machine  readable  form.  The 
technique  was,  in  fact,  used  purely  as  a  storage  medium  for  many  years  before  uei.ig  used  in  conjunction  with  computers. 

The  standard  punched  card  is  divided  into  80  columns  and  1 2  rows,  each  column  being  equivalent  to  one  character 
of  mformation.  Thus,  one  card  can  hold  eighty  characters  of  data  and  an  example  is  shown  in  Tigure  8.  Holes  are 
punched  in  the  card  according  to  the  character  to  be  recorded  and  the  combination  of  holes  for  a  particular  charav  ter  is 


I  : 


lllllllli 


lllllllll 


I 

I  I 


I  ISI 


I  I 


II 


I 


OGOQCOOOOOOQOOOOOQgi|gifg££90dOOOOOOG3|00|OgOOO|OgOOOi)|0000|0000|OOQQOOQOOOOOOOOQ 
i  1  )  <  s  i  i  i  i  h  ii  u » 11  is  it  mt  n  it  »  n i  o  imou  omiiuiijj  vvv'H!Jvi>si)imi*.fu<Bmt«ti.jn;/.i  t  .slum  nn 

|m  n m|m i m  1 1  u5 n m|i  n m  m  m i n  u  i|m  m  i  m  1 1 n  i  m  n i  n  i  n  i  m  n  i 

2|2222?222|222222!R22''2?3222|}222222222222222222222|222|222222|222222222?  2  2222222 

I  1  >  I  >  I  I  I  i  t  II  >J!1  -IIS  Hit  MU  »!.’•  •)  11  *.  .'ll.’ M II  » 11  H  U II  IS  It  11 H  IS  U  II 1111 II  IS  IS  H II  ft  » !l  V 1}  II «  SS  S' M  SI  H II  t!  U  H  IS  « till  IS  n  II  It  >1  It  IS  II II II II  SS 

33|333'13J33g333J33?,B333S33333I333333||U333333  3  33333  3333  3  333333||3  3  3  3333  3333  33  3  3 

4 44 £4  44  <4  4  4  4S4MM4  4|4  4  4  4UM4|4  4  4  4  4  4  4  4  4  4  ||H4  4  444444444444444444444444444444444 

i  1 1  i  i «  i  i  t  >s  ii  oi<i»  a  it  run  nr  mjii. -sis  rust!  'i.?»ni>*  nmm  minis  ;<ii  ii  ir;n  si  si  ssss  s,  si  si  si  ism.  ums> shush  it  >  in  is  tnn.tit  u 

55S5|b‘  S5F.5&5|5S55555|555!55555f|55555555555555S5||5S55555|555G|5!)55S555555555b55 
GG6861S  6566866 JS 6866 8 6|86Dfifi86Gfi|868BS566665666|6 6666 |6g6866|R86BEe66666&68 a  8  666 

o  ■  t  1  *  s  I  I  I  t  II II  *»  IJ  >13  IS  till  113 II H  nil  a. <11  IS  MB  SI  <1  1M  »  JS  J'4  SS  Hint  Ull  0  4.  I>  U  is  SJ  !ISJ  MSI  SS  SS5  VS,HSlt,tl«IS».rS'im  I  ITS  Illy  )"  3iO 

■  ?Hnnmnmiinmii?mn?n|mnnnnnnn|im|?||m?mm??mmn 

'  d  8988888|88888888|88888S8|888888888|Sg|||8||||88|||||8gSg|||||||||6t;8888t>86688S88 

I  I  I  1  I  i  *  I  I  1  M  *1  >2  >S  II  li  3 II II  lilt  1* I'll  UK  II  fl  sin  «  •  *1  >1  :s  Ml  M  a  l' I  mill  IS  U  I  H  l>  Mil  11 1!»1*1M"-!«'1<3S- Until* '**'>>».  ■  '  ,  !*li>  >1  Il.i; 

g9339a999S33S999S9|938999S|999998389|9999S999999S83393999999S39883988989»P89894J9 

i  i  i  i  i  i  i  ■  iuiiii)iHis  <iMtu:!».t,.Mi)j«t!»».viij;u»i».»|iaaiiii<,iini!i!i'uus}sis.,,i<i»>'j>‘>ism*i*»,a*s>'’:iisi';.i.<  j  iisuiui  i  i 

Fig.S  Art  example  of  a  pi  nched  card 


unique.  'Die  characters  recorded  are  printed  along  the  top  of  the  card.  A  variation  on  the  80-column  card  is  the  IBM  96 
column  card  which  is  much  smaller  in  size  and  can  record  more  data.  However,  the  vast  majority  of  cards  in  use  retain 
the  standard  80-column  format.  The  machine  for  punching  a  card  has  a  keyboard  rather  like  that  of  a  normal  typewriter 
except  that  there  are  differences  in  the  layout  of  the  keys.  The  blank  cards  are  stored  in  a  hopper  and  are  fed  through 
the  machine  as  needed. 

Different  machines  have  various  degrees  of  sophistication  and  the  range  of  facilities  includes: 

Verification:  an  operator  re-types  on  the  keyboard  the  content  of  the  cards  in  order 

to  check  whether  the  recorded  data  are  correct.  Usually  a  small  notch 
is  punched  in  the  card  if  the  data  are  correct. 

Sorting:  the  machine  will  sort  the  cards  into  a  predefined  order  based  on  the 

content  of  one  or  more  columns 

Interpreting:  the  characters  corresponding  to  the  sequence  of  holes  are  printed  at  the 

top  of  the  card.  (This  is  useful,  for  instance,  if  a  deck  of  cards  has  'seen 
punched  by  a  computer.) 

The  character  set  on  punched  card  machines  is  usually  limited  in  practice,  for  although  it  is  possible  to  punch  special 
characters,  this  usually  has  to  be  done  by  punching  the  combination  of  holes  for  each  character  individually,  which  takes 
longer.  The  advantages  and  disadvantages  of  this  input  medium  are  shown  in  Table  2. 

TABLE  2 


Advantages  and  Disadvantages  of  the  Punched  Card 
as  an  Input  Medium 


Advantages 

Disadvantages 

Visible  check  of  data  possible 

Individual  cards  can  be  sorted  by 
machine 

Can  be  used  as  a  manual  reference 

Inexpensive  medium 

Large  number  of  card  punching 
machines  in  use 

Density  of  storage  low 

Can  easily  get  out  of  order 

Relatively  slow  on  input  to 
computer  (compared  with 
magnetic  tape) 

(Usually)  limited  character  set 

Necessary  to  repunch  a  whole  card 
to  change  data 

4.3  Punched  Paper  Tape 

Like  punched  cards,  paper  tape  was  available  before  the  advent  of  the  computer  and  since  then  has  seen  widespread 
use,  although  it  has  been  more  popular  in  Europe  than  in  the  USA.  Early  paper  tape  punches  had  five  tracks  giving  a  total 
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of  32  (24)  characters  only;  later  the  number  of  tracks  was  increased  and  the  layout  of  an  eight-track  tape  is  shown  in 
Figure  9.  A  row  of  eight  positions  across  the  tape  corresponds  to  a  single  character  with  seven  positions  representing  the 
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Fig.9  Punched  paper  tape 


character  and  the  eighth,  a  parity  check.  A  single  character  is  represented  by  a  predetermined  set  of  holes  and  blanks. 
The  number  of  holes  punched  is  always  either  odd  or  even,  depending  on  the  convention  used,  and  the  eighth  hole  is 
either  punched  or  not,  to  produce  a  single  parity.  The  technique  is  a  checking  mechanism.  The  small  holes  are  called 
sprocket  holes  and  arc  used  to  feed  the  tape  through  a  reader,  cither  by  engaging  a  sprocket  wheel  or  providing  clock 
pulses  when  being  read  by  a  higher  speed  photoelectric  reader.  The  relative  advantages  of  the  medium  are  listed  in 
Table  3. 


TABLE  3 


Advantages  and  Disadvantages  of  Paper  Tape 
as  an  Input  Medium 


Advantages 

Disadvantages 

Easy  tc  handle 

Takes  up  little  space 

Transported  easily 

Record  always  available 

Variable  length  records 

Inexpensive  medium 

Difficult  to  read  directly 

Records  cannot  be  physically  sorted 

Need  to  repunch  a  whole  record  to 
correct  error 

Relatively  slow  reading  speeds 

Material  tends  to  be  fragile 

4.4  Key-to-Magnetic  Storage  Devices 
4.4J  Introduction 

Devices  which  record  data  directly  onto  magnetic  storage  devices  are  a  comparatively  reccn*  commercial  innovation. 
There  is  now  a  wide  variety  of  machines  available.  Magnetic  tapes,  (loppy  discs  and  discs  are  all  used  as  storage  media. 
(Magnetic  storage  media  are  described  in  chapter  five).  The  machines  often  have  a  processing  capability,  allowing  data 
to  be  validated  before  being  stored.  In  effect  they  are  a  small  computer.  Some  systems  use  a  hardcopy  device  (like  a 
typewriter)  for  displaying  input,  others  use  visual  display  units.  It  is  possible  for  a  number  of  terminals  to  share  a  single 
processor. 

An  outline  of  a  typical  system  is  shown  in  Figure  10  which  shows  the  inside  of  a  DATAP01NT*  key-to-cassette  tape 
processor.  The  system  has  a  typewriter  keyboard  plus  a  separate  numeric  pad  of  1 1  keys.  A  video  display  of  80  columns 
by  12  rows  (or  alternatively  80  by  24),  giving  960  (or  1920)  character  positions,  is  provided.  Information  is  stored  on 
cassette  tapes  and  in  some  cases  there  arc  two  cassette  tape  recorders  in  a  single  machine. 


DATAP01NT  i:  a  copyright  held  by  the  Datapoint  Corporation. 
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Snside  the  DATAPOINT  processors 


Memory 


Fig.  1 0  Schematic  diagram  of  a  DATAPOINT®  key-to-cnssette  tape  processor 
(from  the  Datapoint  Equipment  Catalogue  197S  Copyright  ©  Datapoint  Corporation) 


4.4.2  Hardware 

There  is  a  wide  variety  of  hardware  to  choose  from,  ranging  from  a  single  station  (see  Figure  1 1)  with  one  keyboard, 
typewriter  output  and  simple  cassette,  store,  designed  to  replace  punched  card  and  tape,  to  powerful  multistation  devices 
with  several  keyboards  attached  to  the  same  processor.  The  following  shows  the  range  of  possibilities: 

Processors/Main  Memory 

From  a  small  buffer  (for  storing  the  characters  displayed  on  a  screen)  upwards  the  larger  the  processor,  the  more 
powerful  the  machine  becomes. 

Keyboards  and  character  sets 

Either  traditional  keyboard  or  typewriter  layouts  character  sets  can  be  upper  and  lower  case  and  special  characters 
can  sometimes  be  added. 

Storage  devices 

Cassette  tapes  hold  approximately  100,000  characters.  Diskettes  (floppy  discs)  usually  hold  250,000  characters 
upwards,  depending  on  packing  density  and  the  number  of  read-heads.  Disk  units,  with  a  wide  range  of  storage 
capacities  are  also  available. 

Display  units 

Typewriter  output  or  visual  display  units  of  varying  sizes  are  usual. 

Printers 

A  range  of  printers  is  available  with  extended  character  sets,  with  speeds  from  tens  of  characters  per  second  to 
hundreds  of  lines  per  minute.  It  is  also  possible  to  program  for  certain  special  characters  with  the  matrix  printers 
available. 

Communications 

Communications  facilities,  linking  devices  to  a  computer  are  available  so  that  devices  can  act  as  terminals  or  send 
batches  of  data  (remote  job  entry’)  to  a  central  computer. 


Video 

display 


Keyboard 


Storage: 
cassettes  or 
diskettes 


Fig.  1 1  Components  of  a  basic  key-to-storage  system 


4.4.3  Capabilities  of  Key-to-Storage  Devices 

Data  are  entered  using  the  keyboard  and  are  displayed  on  the  screen,  if  one  is  available.  If  the  operator  sees  an 
error  it  is  usually  possible  to  correct  it  immediately.  With  the  VDU  the  screen  can  be  arranged  in  such  a  way  that  data 
element  names  are  already  presented  to  the  operator.  In  some  systems,  label  or  tag  information  is  held  in  protected 
zones  on  the  screen  which  can  not  be  overwritten  by  the  operator.  A  furthci  advantage  of  this  technique  is  that  only  the 
data  needs  to  be  entered  and  not  the  tag  information.  Once  the  data  are  validated  it  is  possible  to  correct  them 
immediately  on  the  screen.  Following  this  step,  they  can  be  stored. 

Communication  with  a  central  computer  can  be  done  either  by  having  a  direct  communications  link  between  the 
key-to-storage  device  and  the  computer  or  by  using  a  compatible  medium.  Tor  instance,  data  are  often  transferred  to  half 
inch  magnetic  tapes  using  a  converter.  The  possibilities  depend  on  the  equipment  available,  e.g.,  the  Nixdorf  620/15  and 
620/45  (shown  in  Figure  1 2)  transfers  the  information  directly  to  half-inch  tape. 

The  larger  the  machine,  the  greater  is  the  range  of  operations  that  can  be  performed.  Figure  13  shows  the  elements 
of  a  multi-work  station  shared  processor  system.  Larger  processors  offer  more  possibilities  for  validation  and  record 
manipulation.  For  instance,  it  is  possible  to  have  stored  programs  and  use  them  to  operate  on  the  data  in  real  time  If 
compilers  are  available,  tailor-made  programs  can  be  written  for  particular  applications.  Shared  processors  offer  other 
advantages  over  single  units.  For  instance,  certain  equipment  (a  printer,  for  example)  need  only  be  obtained  once. 
However,  if  shared  units  fan.  the  whole  system  can  be  put  out  of  operation  and  hence  the  level  of  duplication  of  equip 
ment  (e.g.,  discs)  needs  to  be  thought  out  carefully. 

4.4.4  Comparison  of  Key-to-Storage  Devices  with  Punched  Tape  and  Cards 

Key-to-storage  devices  were  developed  to  replace  the  punched  card  and  paper  tape  input  systems.  Although  they 
are  more  expensive,  they  do  provide  a  higher  throughput  of  material  and  in  many  cases  an  increased  ‘operator  morale’ 

Data  can  be  entered  much  more  quickly  than  with  the  card  punch  t» cause  of  the  error  correction  and  verification  facilities 
and  because  the  number  of  characters  to  be  entered  can  be  smaller  as,  for  instance,  tag  information  can  be  pre  set  on  the 
screen.  Major  disadvantages  of  key  to-storage  devices  seem  to  be  the  cost,  if  a  more  sophisticated  system  with  printer  and 
VDU  are  acquired,  and  the  lack  of  hardcopy  oq  the  simpler  devices.  The  skill  of  the  operator  needs  to  be  greater  too. 

A  summary  of  the  advantages  and  disadvantages  appears  in  Table  4. 

4.5  On-Line  Input 
4. 5  1  General 

On-line  input  can  be  defined  as  the  input  of  data  directly  to  the  machine  in  which  they  are  stored.  Usual  equipment 
for  input  are  the  typewriter  type  terminal  and  visual  display  units  (VDU).  The  central  processor  is  used  for  data 
verification  and  editing.  There  is  a  wide  variety  of  terminals  available.  Axner  and  Reaghan53  indicate  some  170  products 
from  76  vendots  on  the  USA  market. 


The  major  advantage  of  the  technique  compared  with  key-to-storage  systems  is  the  fact  that  it  is  possible  for  the 
operator  to  interact  directly  with  the  information  files  stored  in  the  mainframe  computer.  T or  instance,  it  is  possible  to 
check  indexing  terms  against  a  thesaurus  immediately  the  data  arc  input.  Error  corrections  can  be  carried  out  quickly 


TABLE  4 


Some  Advantages  and  Disadvantages  of  Key-to-Storage  Devices 


Advantages 

Disadvantages 

IEgher  throughpia  c.f.  punched  card 

Immediate  validation  and  error 
correction 

Formatted  screen  facilities  - 
reduction  in  number  of  characters 
to  key 

Data  quickly  available  for  use 

Low  noise  level 

Easier  handling  of  physical  files 

Relatively  higher  equipment  cost 
but  shared  processors  reduce  cost 

No  immediate  hardcopy  on  simpler 
devices  with  VDUs 

Shared  processor  facilities  can 
mean  that  all  operators  are  idle 

Greater  operator  skill  required  c.f. 
punched  card 

On  the  other  hand,  it  would  be  necessary  to  bear  in  mind  the  costs  of  such  an  input  method:  for  instance,  the  overhead 
for  transaction  processing  in  the  computer  and  having  the  files  on-line  for  long  periods.  Key-to-storage  devices  can  also  be 
on-line,  allowing  data  to  be  entered  locally  and  partly  verified;  later  the  verification  can  be  completed  in  the  mainframe 
computer. 

4.5.2  Software 

Some  manufacturers  produce  general  purpose  software  for  input.  If  not,  or  if  special  facilities  are  to  be  provided 
e.g,  verification  of  the  correctness  of  indexing  terms,  software  will  have  to  be  specially  written.  The  IDRC10 
minicomputer-based  bibliographic  information  system  provides  such  facilities.  Appendix  A  shows  an  example  of  the 
dialogue  between  ihe  computer  and  the  operator.  Another  example  of  an  on-line  input  system  can  be  found  in  the 
GRIPS9  system.  The  facilities  of  this  software  package  for  the  Siemens  7000  series  are  shown  in  Figure  2.  The  central 
module  is  a  data  base  management  system  for  the  maintenance  of  data  bases.  The  input  module  (D1NUPS  -  D1MDI 
Input  and  Updating  Software)  allows  the  creation  of  a  pre-formatted  record  grid  for  various  applications.  An  example 
is  shown  in  Figure  14.  Data  can  be  enteied  into  the  data  base  and  edited  using  DINUPS.  The  advantage  of  the  on-line 
input  comes  when  the  retrieval  software  DIRS2  (DIMDI  Information  Retrieval  System  -  Version  2)  is  used.  It  is  possible 
to  use  the  DIRS2  software  immediately  data  are  entered  to  check  that  they  are  correct.  Thus  the  system  allows  a  full 
interaction  between  the  operator  entering  the  data  and  the  retrieval  software  by  first  using  DINUPS  and  then  DIRS2. 
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Fig.  14  An  example  of  a  pre-formatted  VDU  grid  for  the  DIMDI  DINUPS  input  system 
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4.6  Optical  Character  Recognition 

4.6.1  Description 

Optical  character  recognition  (OCR)  is  a  method  of  data  entry  which  converts  data  into  machine-readable  form 
without  the  need  to  use  a  keyboard.  Generally,  OCR  readers  arc  divided  into  the  following  three  categories: 

-  Optical  character  readers  (OCR) 

Machines  have  varying  levels  of  sophistication  but  the  best  ones  can  read  an  extended  character  set  ot  letters 
and  numbers  which  can  be  typed,  printed,  handwritten  or  output  from  a  computer. 

-  Optical  bar  code  readers 

These  machines  read  a  scries  of  lines  which  represent  a  number. 

Optical  mark  readers 

These  machines  read  marks  placed  in  predefined  areas  of  a  page. 

Optical  character  readers  arc  clearly  the  most  sophisticated  of  the  three  categories  of  machine.  Optical  bar  code 
readers  can  be  used,  for  instance  for  library  circulation  systems,  the  accession  number  and  reader  number  being 
represented  by  a  series  of  lines,  tire  former  in  the  book  and  the  latter  on  an  identity  card.  Optical  mark  readers  recognise 
the  presence  or  absence  of  a  mark  at  a  predefined  point  on  a  page.  They  are  frequently  used  for  survey  work,  question¬ 
naire  processing  etc.  Auerbach*  contains  a  use  Ail  review  of  OCR  equipment. 

4.6.2  Clmi\h'tcr  Sets 

Optical  character  recognition  equipment  can  be  made  to  read  typescript,  printed  pages,  handwritten  material, 
computer  output  and  computer  output  microfilm.  'Hie  typescript  usually  lias  to  be  in  one  or  more  specified  formats. 
Those  recommended  by  the  International  Organization  for  Standardization  are  called  OCR-A  and  OCR-BS4,J5.  The 
character  set  for  the  latter  is  shown  in  Figure  IS.  Such  fonts  can  be  obtained  on  exchangeable  writing  heads,  for  instance 
as  used  on  the  IBM  typewriter. 

A  number  of  other  fonts  are  in  use  such  as  the  IBM  1403  which  is  that  used  ou  the  1403  line  printer.  3/ 1 6"  Gothic 
is  also  a  font  used  for  optical  character  recognition.  At  1NSPFC  (The  Institution  of  Electrical  H,  ..inccra)  a  COMPUSCAN 
OCR  equipment  is  used  with  the  standard  COMBUSCAN  character  set.  An  example  is  shown  in  Figure  1 6.  Some 
machines  such  as  the  IBM  3SSb  are  capable  of  reading  handwritten  material. 

4.6  J  Make-up  of  an  OCR  Reader 

An  OCR  reader  basically  comprises: 

an  input  mechanism, 
a  scanning  and  recognition  mechanism, 
an  output  mechanism  (for  the  processed  documents), 
a  controlling  mechanism, 

a  link  to  a  data  recording  device  or  directly  to  a  computer. 

An  example  is  shown  in  Figure  1 7.  The  input  mechanism  depends  on  the  form  of  the  input  which  can  be: 

journal  tapes  and  strips  (for  example,  a  cash  register), 
document  readers  which  read  lines  or  sections  on  a  page, 
page  readers  which  read  multiple  lines  of  type  on  a  page, 
hand-held  wand  readers. 

The  scanner  and  recognition  subsystems  read  the  characters  and  convert  them  into  electrical  signals  which  are  then 
analysed. 

The  Control  Data  Corporation  model  92650  is  an  example  of  an  OCR  page  reader.  The  macine  can  adapt  to  a 
variety  of  applications  such  as  input  to  an  intelligent  terminal,  remote  input  device  to  a  central  computer,  multi-media 
(OCR  key-to-disc  system),  standalone  OCR  data  entry  system  and  input  device  to  a  communications  system.  OCR-A 
and  OCR-B  fonts  are  available.  Instantaneous  read  rate  is  683  characters  per  second  throughout  or  one  t\dl  page  every 
12  seconds.  The  Siemens  HL5  hand-held  OCR  scanner  is  an  example  of  a  portable  OCR  scanner  which  might  he  used  loi 
a  library  circulation  system.  It  recognises  a  number  of  standard  alphanumeric  fonts. 


4.6.4  Feasibility  of  the  OCR  Solution 

OCR  clearly  lias  tin  advantage  over  other  methods  of  input  when  original  forms  can  be  used,  for  in  tlii«  case  it  is 
not  necessary  to  re-type  (lie  data.  Hence,  OCR  has  clear  advantages  for  documents  which  are  produced  by  (lie  computer 
(such  as  bills  or  orders)  and  which  have  to  be  further  processed  after  data  bare  been  added 


Y.4.1  Illustration  of  OCR-B  dimeters  (complete  set  according  to  ISO  Recommendation  R  1073)  scale  S:  I. 


Y.4.J.I  Characters  ofOCR'B  having  a  strokewidlh  of  0.35  mm  (0.014  in)  for  sires  I  and  II  and  of  0.38  mm 
(0.01  S  in)  for  size  III. 
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V.4.2.2  Character  of  OCR-B  having  a  strokewidth  of  0.3 1  mm  (0.01 2  in)  for  sizes  I  and  I!  and  of  0.3 4  mm 
(0.013  in)  for  size  111,  which  are  considered  in  clause  4.5. 

£$  :  ;  <%>?[!  #&a] 

(  =  )  _ 

Y.4.J.1  Characters  of  OCR-M  having  a  stroke widlh  of  0.31  mr.i  (001 2  in)  for  sires  I  and  II  and  of  0.34  nun 
(0.013  in)  for  size  III,  which  are  considered  in  clause  Y.2.9. 
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Fig.  1 5  The  OCR-B  character  set  of  the  International  Organization  for  Standardization  (ISO) 
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Fig.  1 6  Input  of  bibliographic  information  to  the  COMPUSCAN  OCR  reader  (UK  ihSl’FC) 


Fig.  1 7  Clements  of  an  OCR  (Optical  Character  Recognition)  reader 
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As  a  straightforward  replacement  to  the  key  punch  or  key  to-magnetic  storage  device,  the  advantages  of  OCR  are  less 
clear,  if  a  data  entry  form  has  to  be  specially  created.  The  relatively  high  cost  of  OCR  equipment  means  that  a  high 
volume  of  input  is  necessary  to  justify  the  machine  unless  it  is  shared  with  other  users. 

In  choosing  an  OCR  machine,  the  needs  of  the  applications  must  be  balanced  against  the  level  of  sophistication 
offered  by  the  equipment.  Benchmark  tests  can  be  devised  to  compare  a  machine  with  its  rivals.  Factors  to  look  out  for 
include: 

-  error  rates  and  reject  rates, 

-  throughput, 

-  number  of  different  fonts  read, 

-  input  formats  (page,  document,  journal). 

The  use  of  normal  typewriters  is  a  clear  advantage  over  other  input  equipment  for  operators  do  not  have  to  be  specially 
trained.  OCR  equipment  is  sometimes  linked  with  key  to-disc  equipment  or  a  visual  display  terminal  linked  to  a  main 
frame  computer.  This  allows  data  to  be  validated  and  edited  on-line. 

4.7  Choosing  Input  Equipment 

Often  it  is  necessary  for  budgetary  reasons  to  make  use  of  equipment  already  available  in  an  organisation  (for 
example,  a  card  punch  shared  with  other  applications,  an  existing  visual  display  terminal  or  a  central  data  processing 
department).  Tor  those  who  have  to  choose  new  equipment,  however,  there  are  a  number  of  criteria  which  will  limit  the 
choice  and  some  of  these,  are  given  in  Table  5.  General  advice  on  project  management  is  given  in  Chapter  six.  The  first 
step  to  undertake  is  the  definition  of  the  needs.  These  can  be  obtained  from  a  functional  description  of  each  system  for 
which  data  are  needed.  This  can  then  be  used  to  develop  a  functional  specification  of  the  input  needs,  e.g.,  volume  of 
data,  speed  of  updating  required,  etc. 


TABLE  5 

Steps  to  be  Taken  in  the  Choice  of  Input  Equipment 


1 .  Define  what  the  needs  are: 

-  details  of  input  systems  forms,  data  elements  etc.  (undertake  a  functional  analysis) 

-  specify  the  character  set 

constraints  (e.g.  computer  to  be  used,  communications  facilities  and  peripherals 
available) 

-  volume  of  data 

-  time  constraints 

-  geograpliical  constraints 

2.  Examine  available  equipment  which  would  fulfil  needs: 

-  reputation  of  manufacturer 

-  number  of  installations  (in  your  area) 

-  size  of  company 

-  delivery  dates 

-  documentation 

-  user  support  (maintenance) 

-  price  and  method  of  payment  (purchase,  lease. .) 

-  use  of  a  service  bureau 

3.  Note  details  of  each  system: 

-  hardware  specification 

-  software  facilities 

-  upgrading  or  downgrading  options  -  ease  of  transfer  from  one  to  the  other 

-  diameter  set 

-  is  output  compatible  with  the  mainframe  computer? 

-  size  of  machine 

4.  Look  at  cost/perfonnance  estimates  for  each  system 

5.  Choose  machine 


An  examination  of  the  equipment  available  ean  then  be  made,  cspceially  a  comparison  between  the  facilities  offered 
and  those  required.  Information  on  manufacturers'  products  can  be  obtained  f.om  the  manufacturer,  vendors,  equipment 
reviews,  etc.  Auerbach  Computer  Technology  Reports*  contain  descriptions  of  a. a J able  products  and  discuss  selection 
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criteria.  There  are  a  number  of  selection  criteria  which  will  depend  on  the  weight  attached  to  cadi  point.  Tor  instance,  it 
is  pointless  installing  a  sophisticated  niadiinc  if  the  nearest  service  agent  could  not  come  fot  one  week  following  a  break 
down.  Likewise,  a  reputable  company  which  is  likely  to  stay  in  business  is  important. 

The  reputation  of  the  manufacturer  is  a  good  guide  and  the  number  of  active  installations  is  usually  proof  of  success. 
It  is  always  possible  to  contact  other  users  and  ask  them  their  opinion  of  the  equipment,  a  good  manufacturer  will 
provide  a  list.  User  support  for  training  courses  and  documentation  is  important,  especially  if  the  equipment  is  more 
sophisticated.  The  use  of  a  service  bureau  will  also  need  to  be  investigated  especially  for  OCR  machines. 

The  method  of  pay  ment  will  need  to  be  considered  carefully.  Outright  payment  may  be  cheaper  but  maintenance 
might  not  be  so  good,  there  is  less  of  an  incentive  to  keep  the  machine  running  than  if  lea-.ng  or  rental  arc  opted  for. 
Short  leases  give  more  flexibility  and  the  opportunity  to  change  to  a  more  modern  machine.  Longer  leases  are  usually 
cheaper  per  month.  Different  machines  can  be  compared  for  cost  against  performance.  Staff  morale  is  another  factor, 
key  to-disc  machines  may  give  more  job  satisfaction  than  the  punched  card  machine,  but  costs  may  be  higher. 


5.  INFORMATION  STORAGE  DEVICES 

5.1  Introduction 

Computer  systems  have  a  number  of  storage  devices  which  have  differing  lev  els  of  performance,  cost  and  storage 
techniques.  Storage  devices  are  characterized  by: 

-  the  cost  per  byte  of  storage, 

-  the  size  of  the  store, 

-  the  access  time,  the  time  to  retrieve  the  data, 

-  the  data  transfer  rate. 

The  major  types  of  memory  in  use  are: 

main  memory',  magnetic  drums,  magnetic  discs,  magnetic  tapes,  mass  storage  devices, 
holographic  and  photographic  memory. 

The  different  storage  media,  because  of  the  wide  range  in  cost,  capacity  and  performance,  are  used  for  different  purposes. 

5.2  Main  Memory 

Main  memory  usually  has  an  access  time  of  less  than  a  microsecond,  the  speed  depending  on  the  technology  used. 
The  most  well  known  is  the  ferrite  core  which  is  a  core  of  magnetic  material  which  can  be  magnetized  in  one  of  two  direc¬ 
tions.  There  arc  other  ways  of  employing  magnetic  material,  such  as  thin  films  and  plated  wires.  Magnetic  storage  is 
rapidly  being  overtaken  by  semiconductor  storage.  Using  the  techniques  of  large  scale  integration  tLSl)  and  very-large- 
scale  integration  (VLSI)  a  large  number  of  memory  circuits  can  be  built  into  a  single  'chip*  of  silicon.  Mam  memory  sizes 
up  to  64K  bits  on  a  single  chip  arc  now  being  produced  and  vised  in  several  computer  systems. 

The  physical  size  of  main  memory  is  usually  up  to  several  million  bytes  (megabyte)  with  a  purchase  cost  of  several 
tens  of  thousands  of  dollars  per  megabyte,  it  is  used  for  the  storage  of  programs  which  are  currently  being  executed  and 
for  data  undergoing  processing.  Tnc  operating  system  is  also  stored  here.  Many  operating  systems  can  execute  a  nun'  er 
of  programs  at  once,  using  techniques  called  multiprogramming  and  time  sharing.  These  progiains  arc  stored  in  diffcrciu 
parts  of  the  main  memory .  Size  of  the  main  memory  plays  a  large  part  in  determining  the  throughput  of  the  computer 
e.g  because  more  programs  can  be  executed  concurrently  and  larger  and  more  sophisticated  operating  systems  can  be 
employed.  A  number  of  independent  manufacturers  arc  producing  'addon*  memories  which  can  be  added  to  existing 
machines.  Sometimes  prices  compare  favourably  with  those  of  the  computer  manufacturer. 

5.3  Direct-access  Devices 

Magnetic  drums  and  discs  arc  called  direct  access  devices  because  of  the  ability  to  access  data  directly  rather  than 
sequentially  as  when  using  a  magnetic  tape.  In  fact  the  speed  of  access  of  data  is  significantly  slower  than  for  main 
memory. 

The  fastest  of  these  devices  is  the  inagnelL  Jrum  (sec  Figure  18)  which  has  a  cylindrical  shape  and  rotates  at  several 
thousand  revolutions  per  minute.  The  cylinder  is  coated  with  iron  oxide  and  a  series  of  heads  which  write  and  read 
information,  are  placed  along  the  drum.  These  heads  do  not  move  and  hence  these  devi.  es  arc  sometimes  called  hea.l- 
per  track’  devices  because  data  arc  recorded  on  a  band  (‘track*)  of  the  drum  covered  by  a  single  head.  Access  time  is 
normally  a  few  milliseconds  and  transfer  rates  arc  greate,  than  1  million  bytes  per  second.  Storage  capacity  is  ty pically 
in  the  tens  of  megabytes  range. 

Many  computers  use  virtual  memory’  systems  which  n.  effect  expand  the  size  of  the  main  memory.  In  practice, 
main  memory  is  split  up  into  sections,  often  called  pages  and  these  are  moved  from  main  memory  to  drum  storage  when 
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Fig.  1 8  Outline  of  a  magnetic  drum 


they  are  not  immediately  required.  Drums  are  also  used  for  files  of  information  which  are  in  frequent  use,  the  keeping  of 
which  in  main  memory  is  not  justified. 

Magnetic  disc  devices  consist  of  a  series  of  flat  circular  platters  which  are  held  horizontally,  are  stacked  one  alxve 
the  next  and  which  are  rotated  about  a  common  vertical  axis.  Each  platter  is  coated  with  magnetic  matenal.  There  a-e 
two  methods  of  reading  and  writing,  with  fixed  or  with  moveable  heads.  In  the  fixed  head  device  the  heads  are  arrange  1  as 
shown  in  Figure  19.  As  with  the  drum,  there  ,’s  one  head  per  track  and  hence  the  performance  of  these  devices  is  similar. 
Moving  head  discs  have  one  read  head  per  platter  side  (see  Figure  20).  The  read  heads  are  attached  to  a  series  of  arms 
which  are  connected  together  to  form  a  comb.  To  read  or  write  to  a  particular  track,  the  comb  has  first  to  be  positioned 
on  the  appropriate  part  of  the  disc.  All  the  read/write  heads  move  together  so  data  are  usually  written  in  ‘cylinders'  le. 
all  the  tracks  which  can  be  written  without  moving  the  heads. 
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i 


Fig.l  9  Section  through  a  fixed-head  magnetic  disc  storage  unit 


Access  time  for  discs  is  slower  than  for  drums,  this  time  is  dependent  on  the  seek  time  (the  time  to  find  a  particular 
track)  and  the  rotational  delay.  Some  discs  can  be  removed  from  the  drive,  increasing  the  storage  capacity  of  the  machine. 
Capacities  vary  from  several  megabytes  (single  platter  cartridge)  up  to  several  hundred  megabytes  (.multiple  head,  multiple 
platter).  Average  access  times  of  around  20  msec,  a  transfer  rate  of  1.2  megabytes  per  second  and  a  capacity  of  635 
million  bytes  are  available. 

Even  smaller  capacities  are  available  on  the  flexible  discs,  sometimes  called  floppy  discs  or  diskettes.  The  flexible 
disc  is  a  circular  piece  of  polyester  film  of  about  5  inches  (or  larger)  which  is  coated  in  magnetic  iron  oxide  and  covered  in 
an  outer  plastic  or  paper  jacket.  Capacities  depend  on  recording  density  and  whether  both  sides  of  the  disc  are  utilised, 
but  over  1  megabyte  can  be  stored  on  a  single  disc. 

Discs  play  an  important  part  in  the  operation  of  time  sharing  and  multiprocessing  systems  by  providing  fast  access 
to  larger  amounts  of  data.  They  are  used  in  library  and  information  centres  for  storing  large  files  of  information.  For 
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Fig.  20  Section  through  a  moving-head  magnetic  disc  storage  unit 


instance,  millions  of  bibliographic  references  are  available  on-line  from  data  base  services.  Because  of  the  ability  to 
retrieve,  information  quickly,  a  customer  can  for  instance,  obtain  bibliographic  details  of  articles  of  interest  within  a  few 
seconds.  Costs  are  kept  competitive  because  many  users  can  be  served  simultaneously.  The  disc  storage  facilities  avail¬ 
able  on  mini-computers  are  also  improving  and  are  now  extending  the  possibilities  of  on-line  operation  for  smaller 
organizations. 

5.4  Magnetic  Tape 

Magnetic  tape  is  a  recording  medium  known  to  most  people  because  of  its  use  for  recording  music.  In  the  computer 
world  the  54  inch  9-track  tape  has  become  standard.  The  normal  2400  ft  tape  can  store  over  40  million  bytes  when 
recorded  at  a  density  of  1600  bytes  per  inch.  Tapes  of  shorter  length  are  available,  as  are  cassette  tapes  which  are 
frequently  used  for  input  of  data.  The  high  capacity,  low  cost  and  reasonably  high  data  transfer  rates  make  magnetic 
tape  a  valuable  medium  for  archival  storage  ard  for  storing  files  of  information  which  have  to  be  accessed  sequentially. 
However,  access  time  is  very  high  because  of  the  time  taken  to  mount  the  tape  in  the  first  place  and  because  the  tape  has 
to  be  unwound  to  reach  the  required  part. 

5.5  Mass  Storage  Devices 

The  mass  storage  device  provides  an  answer  to  the  problem  of  handling  magnetic  tapes.  The  equipment  consists  of 
an  array  of  magnetic  cartridges  housed  in  a  honeycomb  shaped  store.  The  capacity  of  the  Control  Data  Corporation 
cartridge  is  8  million  bytes.  When  a  particular  cartridge  is  requested,  an  electromechanical  device  retrieves  the  requested 
cartridge  and  transfers  the  data  to  a  disc.  Storage  capacities  of  the  IBM  3850  and  CDC  98500  systems  are  high;  IBM 
claim  between  35  billion  and  472  billion  bytes  and  the  average  access  time  is  seconds.  CDC  claim  a  3.S  second  average 
access  time. 

A  somewhat  similar,  but  now  largely  outmoded  device  is  the  data  cell  which  retrieves  the  selected  strip  of  flexible 
magnetized  material  and  fastens  it  to  a  rotating  drum. 

5.6  Photocopy  and  Laser-Holographic  Storage 

Large-capacity,  direct-access  storage  devices  exist  which  use  laser  technology.36  Information  is  written  onto  photo¬ 
graphic  film  or  a  holographic  substrate  using  a  laser.  Hence,  the  store  comprises  a  magazine  of  holographs  or  films  which 
are  retrieved  in  a  fashion  similar  to  that  of  the  data  cell. 


5.7  Newer  Memory  Types 

Other  types  of  storage  devices  are  coming  onto  the  market.  A  charge-coupled  device  (CCD)  was  recently  announced27 
and  is  intended  to  be  a  replacement  for  magnetic  drums.  CCD  and  bubble  memories  arc  designed  to  fill  the  gap  between 
main  memory  and  direct  access  storage,  providing  access  to  data  in  the  millisecond  to  microsecond  range. 

Storage  is  likely  to  decrease  in  cost  and  increase  in  performance  in  the  next  few  years.  Discs  with  over  1000  mega¬ 
bytes  capacity  are  likely  to  be  available  with  transfer  rate  of  several  megabytes  per  second  and  access  time  less  than  20 
milliseconds.  Costs  of  storage  may  well  be  less  than  $500/megabytes/month  for  main  memory,  bubble  or  similar  less 
than  $50,  discs  S 1 ,  mass  storage  less  than  one  cent  (USA  Currency). 
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6.  THE  MANAGEMENT  OF  PROJECTS 

6.1  The  Project  Environment 

The  introduction  of  a  new  system,  be  it  an  input  or  storage  system  described  above  or  another  type  of  system, 
usually  involves  changes  within  the  organization.  The  introduction  of  change  can  be  difficult  and  costly  unless  the  change 
is  carefully  planned  and  executed.  Many  organizations  introduce  new  systems,  using  the  concept  of  a  'project*.  A  project 
can  be  defined  as  a  temporary  organization  of  resources  to  achieve  a  particular  goal.  The  project  usually  has  a  leader  or 
manager  whose  task  it  is  to  see  that  the  project  is  carried  out  successfully  and  within  the  budget. 

An  important  aspect  of  a  project  is  the  project  plan,  which  will  usually  be  devised  by  the  project  manager. 

Projects  are  often  split  up  into  phases  (see  paragraph  6.2)  and  at  the  end  of  one  phase,  a  detailed  plan  for  the  succeeding 
phase  is  drawn  up.  An  overall  plan,  produced  at  the  beginning  of  the  project,  should  be  updated  at  the  end  of  each  phase. 
Plans  should  include  details  of: 

-  project  costs  (including  cost/benefits  analysis), 

-  timescale  (schedules  for  completion  of  project,  milestones,  etc.,), 

-  manpower  requirements, 

-  equipment  requirements. 

A  specification  of  the  work  to  be  done  should  be  drawn  up. 

The  planning  documents  together  with  the  results  of  a  proceeding  step,  allow  a  decision  to  be  made  on  whether  to 
go  on  to  the  next  phase  of  the  project.  If  any  decisions  on  alternative  paths  have  to  be  made,  management  is  able  to 
make  a  decision  with  the  maximum  information  available,  including  detailed  information  on  costs.  In  any  case,  the 
decision  to  go  ahead  is  only  to  the  end  of  the  next  step. 


6.2  The  Systems  Approach 


6.2. 1  The  Stepwise  Approach  to  System  Implementation 


It  is  now  recognized  that  new  systems  can  be  efficiently  introduced  or  old  ones  changed  using  a  stepwise  approach 
which  maximizes  the  chances  of  implementing  the  correct  system,  allows  management  to  make  lnforme'1  policy  decisions 
and  minimizes  the  effort  spent  if  changes  are  made  or  if  the  project  is  cancelled.  For  information  system  projects  it  is 
usual  to  consider  the  following  consecutive  steps: 


Conceptual  stage: 
Initial  stage: 


Systems  analysis  and  definition: 
Design: 

Construction  and  installation: 
Evaluation: 


the  project  is  considered  for  adoption 

a  minimum  of  effort  is  expended  to  provide  enough 
information  to  decide  whether  it  is  worthwhile  preceding 
with  the  project 

the  customer  and  the  organization  carrying  out  the  project 
agree  on  what  the  system  should  do 

the  system  is  designed  i.e.  a  detailed  description  of  how 
the  system  is  to  be  constructed  is  drawn  up 

the  system  is  built  and  installed 

a  periodic  review  to  sec  whether  the  system  is  performing 
according  to  specification  and  desires 


Three  of  these  steps  i  e  systems  analysis  and  definition,  design,  and  construction  and  installation  are  briefly  described 
below.  For  specific  details,  the  reader  is  advised  to  turn  to  standard  texts  (see  list  of  references) 


6.2.2  Systems  Analysis  and  Definition 

The  systems  analysis  and  definition  phase  of  a  project  is  perhaps  the  most  important  of  all  the  steps  outlined  above. 
It  is  at  this  stage  that  the  needs  are  specified  in  detail.  It  therefore  offers  an  opportunity  for  management  to  look  at  the 
goals  of  the  organization.  With  the  introduction  of  a  new  system,  it  is  possible  to  cither  carry  out  the  present  services 
more  efficiently  or  introduce  improved  services  instead.  For  instance,  a  computer-based  system  could  offer  an  indivi¬ 
dualized  selective  dissemination  of  information  service  (SDI),  if  this  were  not  already  being  offered,  in  addition  to 
production  of  a  current-awareness  bulletin. 

The  tasks  required  to  be  carried  out  fit  this  stage  of  the  project  include: 

-  an  examination  of  objectives  of  the  existing  and  the  proposed  system, 

-  the  degree  to  which  current  objectives  are  being  met, 

-  a  revision  of  objectives  if  necessary, 

-  an  examination  of  the  needs  of  the  customer  (department  which  is  paying  for  the  system)  and  the  the  user, 

-  the  preparation  of  a  functional  specification  which  describes  what  the  new  system  should  do. 
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The  functional  specification  or  system  definition  serves  as  a  form  of  agreement  between  the  organization  carrying  out  the 
work  amt  five  customer.  It  should  include: 

a  general  description  of  the  system  including  objectives  and  functions, 
a  cost  benefit  discussion, 

details  of  information  flow  and  data:  input  data  (including  source  and  method  of  creation),  processing  needs, 
output  requirements  (including  distribution), 
audit  controls, 
time  constraints, 

’people'  factors  (e.g.,  definition  of  responsibilities), 
contents  of  files  and  updating  details. 

A  plan  for  the  design  phase  should  be  drawn  up  and  the  overall  project  plan  updated, 
ft  2J  Systems  Design 

The  main  objective  of  five  design  phase  is  to  produce  a  detailed  description  of  the  system  to  be  implemented. 
Individual  details  of  the  system  have  to  be  specified,  e.g.,  program  specifications,  files,  details  of  output  and  input  forms, 
hardware  and  special  software  to  be  used  and  information  flow.  An  important  part  o’,  this  stage  is  the  system  flowchart 
which  describes  the  flow  of  data  through  five  system.  Some  organizations  arc  now  using  the  HIPO  (Hierarchical  Input 
Processing  Output)  technique30  instead  of  a  flowchart.  To  back  up  the  flowchart,  a  description  of  each  operation  is 
drawn  up. 

The  work  to  be  done  at  this  stage  will  depend  on  the  strategy  adopted  for  example,  whether  software  can  be 
acquired  or  programming  needs  to  he  done,  whether  hardware  is  already  available,  and  the  complexity  of  the  system. 
Grosclv28  considers  tlvat  before  any  software  or  equipment  is  selected,  a  broad  definition  of  the  application  and  its 
environment  should  be  made,  including: 

explanations  of  tasks  to  be  performed, 
definition  of  the  specific  data  to  he  used, 
response  requirements  of  the  system, 
the  size  of  the  data  base  and  growth  rate, 
the  number  and  geographical  location  of  terminals. 

The  availability  of  suitable  software  is  more  and  more  determining  tlve  choice  of  machine.  Software  has  an  even 
greater  impact  if  a  dedicated  computer  is  to  be  acquired.  Some  of  tlve  factors  to  be  taken  into  account  in  choosing  soft¬ 
ware  are  discussed  in  Chapter  two. 

Points  to  oe  considered  in  the  design  include; 

details  of  output  required  (e.g.,  formats,  quantity,  distribution), 

details  of  data  elements, 

processing  and  logical  operations  (system  logic), 

audit  and  control  factors. 

details  of  input  (e.g..  formats,  sources  of  data), 

details  of  files,  storage  devices,  updating  factors,  data  volume,  response  times, 
specification  of  computer  programs, 
specification  of  manual  procedures. 

It  is  important  to  consider  the  human  ivvterfaces,  especially  the  problems  of  system  operation  and  user  interaction. 

The  design  specification  should  he  included  in  a  report  which  will  also  contain  a  plan  for  five  following  stage  i.e. 
implementation.  The  master  plan  for  the  project  should  also  be  brought  up  to  date. 


o  JT  Construction  oiu!  Insuillotion 

During  this  phase  the  system  is  built  and  commissioned,  five  documents  used  to  effect  this  implementation  are  the 
results  of  the  previous  phases,  especially  the  desig  v  .pecification  and  the  functional  specification.  Tasks  to  be  performed 
include: 

writing  computer  programs, 

file  creation, 

documentation, 

acquisition  of  equipment, 

training  in  five  use  and  operation  of  the  system, 

phase  in.  comprehensive  testing  and  proving  five  system. 

It  is  likely  that  this  stage  will  he  the  most  difficult  from  the  planning  point  of  view.  Hence  the  plan  produced  at  five  end 
of  the  design  stage  should  be  detailed  and  at  the  same  time  flexible.  The  implementation  period  is  also  critical  for  deter 
mining  whether  the  system  meets  the  requirements.  Comprehensive  and  thorough  testing,  operating  the  new  system  in 
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parallel  with  the  old,  user  training  and  systems  evaluation  techniques  are  some  of  the  activities  required  at  this  stage  in 
order  to  determine  performance  and  to  ensure  satisfactory  operation  of  the  new  system. 


6.3  Personnel 

Tire  personnel  needed  for  the  introduction  of  a  computer  application  are  usually  specialists  in  a  particular  area 
(e.g.  systems  analysts,  programmers).  If  the  necessary  people  are  available  within  the  organization,  this  represents  the 
best  solution  because  they  are  likely  to  understand  the  particular  needs  of  the  organization  and  the  expertise  gained  will 
not  be  lost  at  the  end  of  the  project.  If  not,  there  are  other  possibilities: 

-  staff  can  be  hired  from  consultancy  organizations  and  software  houses, 

-  the  whole  or  parts  of  the  project  could  be  contracted  out  to  another  organization. 

The  major  difference  between  the  two  approaches  is  that  in  the  former,  manpower  can  be  hired  and  fitted  into  a  project 
team.  This  is  likely  to  be  the  best  approach  if  project  managers  with  the  correct  background  are  available  in-house, 
because  staff  can  then  keep  control  of  the  project  and  can  learn  from  the  temporary  hired  personnel. 


It  will  be  necessary  to  decide  on  the  degree  of  involvement  the  in-house  staff  will  have  after  the  system  is  installed, 
for  instance,  who  will  maintain  -he  system.  The  type  of  personnel  likely  to  be  needed  for  systems  development  are: 


Project  Leader: 
Systems  analyst: 

System  designers: 

Programmers: 

Computer  operators: 
Data  base  manager: 


Management  capability,  understanding  of  application  and  of 
computer  systems 

Experience  of  techniques  of  systems  analysis  plus  understanding 
of  the  application  and  the  computer  systems,  knowledge  of 
existing  similar  applications 

knowledge  of  software,  programming  techniques,  hardware, 
communications 

knowledge  of  programming,  data  base  management  systems 
(if  used) 

ability  to  operate  the  system 

probably  a  librarian  or  information  specialist  with  knowledge  of 
the  computer  and  data  base  management  systems  who  is  in  charge 
of  the  data  bases  in  the  computer 


The  people  who  will  operate  and  use  the  system  will  also  be  involved  in  the  systems  development.  It  is  sometimes 
possible  to  obtain  people  who  can  undertake  two  or  more  of  the  above  functions,  for  instance,  systems  designers  are 
sometimes  used  for  programming.  This  is  a  developing  trend  as  data  base  software  becomes  more  specialized. 


It  is  important  that  the  computer  system,  once  operational,  is  not  dependent  on  one  person.  How  often  are 
programs  understood  only  by  the  person  who  wrote  them?  It  is  not  a  good  practice  tc  let  this  happen.  The  distribution 
of  tasks  among  a  group  of  people  and  the  provision  of  good  documentation  are  essential  and  in  any  case  lead  to  better 
job  satisfaction  and  staff  flexibility. 


6.4  Other  Project  Management  Considerations 

The  above  treatment  has  been  necessarily  brief  and  there  are  a  number  of  salient  points  in  addition  to  those 
discussed  above,  which  should  be  borne  in  mind  when  embarking  on  an  information  processing  project.  Some  of  these 
points  are  listed  below,  but  the  subject  is  covered  in  greater  depth  in  relevant  works  cited  in  the  list  of  references, 
especially  by  Gilderslcevc31,  Cleland35  and  Bingham33.  In  particular: 

-  the  project  environment  should  be  encouraged; 

-  the  project  leader  should  be  given  the  necessary  power  and  support  to  carry  out  a  project  and  the  job  should 
carry  the  requisite  rewards; 

-  the  customer  (the  department  requesting  the  work  done)  should  pay  for  the  system; 

-  the  individual  chronological  steps  of  the  project  (outlined  in  paragraph  6.2)  should  be  kept  separate  and  cutting 
comers  should  be  avoided; 

-  planning  documents  should  be  produced  before  considering  the  next  step; 

-  a  regular  reporting  procedure  should  be  introduced; 

-  the  project  leader  should  keep  all  interested  parties  informed; 

-  the  specification  of  what  is  required  should  be  frozen  and  agreed  to  in  writing; 

-  high  level  management  should  be  involved  in  the  analysis; 

-  a  standard  for  documentation  and  methods  of  work  should  be  established  and  maintained  (e.g.  structured 
programming  techniques); 

-  check  points  (dates)  for  regular  reviews  of  the  project  should  be  introduced; 

-  project  members  should  be  motivated  by  assigning  them  tasks  which  are  not  too  difficult,  but  which  will  develop 
them,  giving  them  as  much  independence  as  possible  and  producing  written  appraisals  of  their  performance; 


34 


panic  situations  should  be  avoided; 

project  members  should  work  physically  near  to  each  other; 

each  team  member  should  have  an  agreed  job  title  and  job  description. 
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6.5  Contract  Problems 

If  work  is  contracted  out,  normal  contract  practices  should  be  adopted  but  the  following  discussion  serves  to  high¬ 
light  certain  important  points.  A  detailed  description  of  the  work  to  be  done  and  a  cost  estimate  should  be  developed 
before  contacting  potential  companies.  It  is  useful  to  break  down  the  project  into  a  number  of  tasks  for  which  man¬ 
power,  equipment,  material  and  overhead  costs  can  be  calculated.  Allowances  should  be  made  for  human  underestima¬ 
tion  of  the  duration  of  a  task.  A  contingency  factor  for  each  task  should  also  be  added.  It  is  sometimes  useful  to 
compare  cost  estimates  with  the  money  spent  on  previous  similar  projects.  For  preparation  of  ’calls  for  tender’,  ’requests 
for  proposals’  or  contracts,  the  technical  specification  should  be  as  detailed  as  possible,  unambiguous  and  should  be 
written  by  an  expert  and  agreed  on  by  interested  parties  within  the  organization  before  sending  them  out.  ‘Requests  for 
proposals’  are  often  used  when  the  task  cannot  be  well  defined.  The  end  product  is  often  a  report  and  hence  they  are 
best  suited  for  the  initial  stages  of  a  project.  The  exact  details  which  form  the  content  of  a  subsequent  contract  are 
usually  negotiated,  following  an  initial  selection  procedure.  Calls  for  tender  are  usually  more  specific  and  hence  the 
description  of  the  requirements  must  be  very  specific,  too.  An  organization  which  bids,  does  so  knowing  that  a  price  is 
attached  to  the  bid  which  if  accepted,  is  usually  adhered  to  unless  later  negotiations  change  the  description  of  require¬ 
ments. 

Contractors  can  be  selected  on  the  basis  of: 

-  the  least  expensive  tender, 

-  the  organization  most  likely  to  provide  the  desired  product  as  specified  in  the  contract, 
the  reputation  of  the  tendering  organization, 

-  previous  experience  in  the  area, 

-  stability  of  the  company, 

-  service  facilities  in  the  geographic  area. 

In  order  to  establish  the  capabilities  of  the  company  the  following  can  be  examined: 

-  the  management  organization, 

-  the  organizational  procedures  used  throughout  the  company, 

-  standards  used  by  the  company  to  carry  out  work, 

-  controls  and  checks  of  work  normally  adopted, 

-  use  of  modem  techniques  such  as  structured  programming,  HIPO  charts  etc.. 

-  qualifications  and  experience  of  staff  (specify  the  staff  who  will  work  on  the  pr  jeer  if  feasible), 

-  examination  of  previous  work. 

A  site  visit  to  the  company  can  be  made,  especially  an  unexpected  one,  to  ir.spect  the  organization  and  the  quality  of  the 
staff.  For  the  evaluation  of  offers,  the  following  should  also  be  considered  i.>  addition  to  the  companies  capabilities: 

-  does  the  offer  satisfy  all  the  requirements? 

-  have  the  company  deliberately  underbid? 

-  summarize  each  offer  to  facilitate  comparison, 

-  arc  the  time  constraints  feasible? 

Bids  should  be  evaluated  by  a  group  of  people  (review  pane!)  who  understand  the  field  of  application.  Controls  should 
be  built  into  the  specification  of  a  contract  e.g.  progress  meetings,  inspections,  interim  reports  etc.  Other  factors  to  be 
specified  include: 

subcontracting  rules, 

testing  criteria  for  product  (benchmarks  etc.), 

-  inspections  or  audits  to  be  made  at  the  suppliers  works  (if  necessary). 

Once  a  contractor  has  been  selected,  a  single  person  should  be  nominated  to  communicate  with  the  contractor.  (It  is 
usual  to  insist  that  the  contractor  do  the  same.)  This  person  will  control  the  execution  of  the  contract  through. 

-  regular  progress  meetings, 

keeping  detailed  documentation  on  the  progress  of  the  project, 
using  an  agreed  ‘change  notification  procedure’, 
ensuring  interim  payments  are  only  made  following  satisfactory  progress, 
carrying  out  inspections  etc., 

ensuring  that  the  product  as  delivered  is  satisfactory  before  authorizing  payment. 


0.6  Operational  Problems 

The  introduction  of  a  computer  system  requires  an  extremely  methodological  approach  to  the  day-to-day  running 
of  the  library  or  information  service.  Although  the  computer  offers  advantages  in  terms  of  power,  speed  etc.  over 
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manually  maintained  files,  it  also  demands  a  very  systematic  approach.  In  order  to  achieve  this,  the  use  or  a  manual 
of  procedures  is  recommended.  This  manual  should  conta!n  a  checklist  of  the  way  a  particular  task  is  to  be  caried  out 

A  related  problem  is  that  of  systems  maintenance.  This  involves  the  corrections  of  errors  in  the  system,  imple¬ 
mentation  of  new  versions  and  other  changes.  The  amount  of  effort  needed  for  this  aspect  of  the  system  is  often  under¬ 
estimated. 

If  a  computer  is  used  outside  the  library  or  information  services,  a  procedure  manual  for  the  computer  operators  is 
essential  to  ensure  that  the  regular  tasks  arc  carried  out  on  schedule.  Manuals  should  cover  every  aspect  of  the  service, 
such  as  security  procedures  for  dealing  with  classified  information  on  magnetic  tape  and  instructions  for  running 
computer  programs  etc. 

A  data-base  administrator  should  be  appointed.  His  function  will  be  to  arrange  the  safety  and  availability  of  data  at 
all  times  and  to  ensure  that  data  files  are  not  corrupted  (damaged). 

The  assignment  of  individual  job  titles  and  descriptions  is  also  valuable  so  that  each  person  has  a  clear  understanding 
of  his  responsibilities. 


7.  CHEMICAL  STRUCTURE  INFORMATION  SYSTEMS  -  A  BRIEF  INTRODUCTION 

Tins  chapter  contains  a  brief  introduction  to  input  and  storage  of  chemical  structure  information  111  is  subject  is 
one  that  has  been  comprehensively  dealt  with  in  the  literature  and  several  texts34’35*36  are  available. 

The  major  ways  of  representing  a  chemical  structure  are  the  nomenclature,  the  molecular  formula  and  the  structure 
diagram.  The  problem  of  inputting  the  first  two  of  these  is  no  different  from  that  of  other  information  in  character  form 
Retrieval  of  the  desired  chemical  substance  can  be  achieved  using  molecular  formulae  and  nomenclature  indexes,  which 
are  produced  by  computer  using  this  information.  The  indexes  to  Chemical  Abstracts  show  the  variety  of  indexes  which 
can  be  produced. 

In  order  to  store  a  structure  diagram  in  the  computer  it  is  usually  necessary  to  transfer  the  structure  into  symbols 
which  can  be  directly  input  to  the  machine.  The  first  problem  is  to  transfer  the  two  or  three  dimentional  chemical 
structure  diagram  into  a  set  of  characters  which  can  be  manipulated  by  the  computer.  There  are  two  basic  approaches 
to  storing  information  on  chemical  structures  by  means  of  a  string  of  symbols. 

The  first  is  a  topological  representation,  often  known  as  a  connection  table  or  connectivity  matrix  which  describes 
the  atoms  and  their  interconnections  (bonds).  Each  type  of  atom  and  bond  of  a  structure  is  given  a  symbol  and 
individual  atoms  are  given  a  number.  A  table  of  atom  types  and  bond  connections  can  be  built  up,  which  can  then  be 
stored  in  the  computer.  This  sys  em  was  adopted  by  Chemical  Abstracts  Service  (CAS)  for  its  chemical  registry  system 

The  second  approach  is  based  on  the  use  of  linear  notation.  The  best  known  of  these  is  that  devised  by 
W.J.Wiswesser  m  the  early  1950’s,  It  comprises  a  string  of  alphanumeric  characters  which  are  used  to  describe  a  structure 
Symbols  are  assigned  to  individual  atoms  or  groups  of  atoms  and  bonds.  For  example,  ai/lic  atoms  are  given  a  single 
letter  e.g.  Fluonnc  ‘F\  Chlorine  'G\  A  benzene  ring  is  represented  by  'R\  A  fuller  description  can  be  found  in  Ash  and 
Hyde3S. 

Input  of  chemical  structure  information  can  be  undertaken  by  entering  the  string  of  symbols  corresponding  to  the 
structure.  Linear  notations  are  usually  shorter  than  connection  tables  and  hence  can  be  encoded  and  input  more  quickly 
On  the  other  hand  the  connection  table  can  be  constructed  by  someone  who  is  not  a  chemist,  whereas  the  notation 
demands  a  knowledge  of  chemistry. 

Alternative  methods  of  input  do  exist.  Zamora  and  Dayton’1  describe  the  use  of  a  chemical  typewriter  for  input 
of  structure  data  at  Chemical  Abstracts  Service.  The  system  comprises  a  Varian  620i  minicomputer  together  with 
twelve  chemical  typewriters  which  are  specially  modified  IBM  735  Selcctric  typewriters  with  reverse  index  and  a  special 
typing  element.  Special  characters  such  as  /  9  /  \  arc  included.  CAS  also  have  interactive  graphics  devices  for  input 
Another  graphic  method  is  to  use  optical  scanning  of  structures  drawn  on  a  special  grid. 

Chemical  structure  information  can  be  subjected  to  a  number  of  error  checks  before  being  stored  Examples  of 
checks  are: 

-  use  of  correct  symbols, 

-  comparison  with  molecular  formula  (input  separately), 

-  validity  of  syntax. 

An  important  feature  of  a  chemical  structure  information  system  is  the  process  of  registration  in  which  each  unique 
structure  is  given  a  unique  number.  The  difficult  problem  is  to  „void  registering  duplicates.  CAS  uses  a  computer  program 
to  create  a  unique  connection  table  from  the  one  input  which  is  then  compared  with  all  those  already  registered 
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Chemical  structure  search  packages  are  available.  One  example  i?  the  CROSSBOW  sys.em  of  Imperial  Chemical 
Industries  UK,  which  uses  both  Wiswesser  Line  notations  and  connection  tables  for  different  functions.  Compounds 
having  combinations  of  desired  fragments  orsubstructures  can  be  retrieved  from  the  computer  and  the  respective 
structure  diagrams  can  be  displayed. 
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APPENDIX  A 

AN  EXAMPLE  OF  ON-LINE  DATA  ENTRY  AT  THE  LIBRARY  OF  THE 
INTERNATIONAL  DEVELOPMENT  RESEARCH  CENTRE  (CANADA) 


The  following  dialogue  shows  how  a  record  of  an  item  is  to  be  ordered,  catalogued,  and  entered  into  a  data  base,  via 
an  on-line  terminal  When  the  item  subsequently  arrives  in  the  library,  tho  cataloguers  further  process  the  information  to 
produce  an  expanded  bibliographic  description.  Finally  an  abstract  is  prepared. 


VALID  FUNCTIONS  ARE* 
ENTRY  A 

INDEX 

LIBTFORMAT 
MODIFY 
PRINT 
QUERY 
EXIT 


options  for  interacting  with 
the  data  ba30. 

'entry'  is  selected  for  a  new 
item  to  be  input. 


WHICH  FUNCTION  DO  YOU  WANT?  entry  < —  item  to  be  ordered, 
PROGRAM  LOADED  WITH  LID  -  P 

ENTRY  A* 01* 04  LOGON  MON?  JUL  10?  1978?  3119  PM 

PLEASE  ENTER  DATA  BASE  NAME  OR  EXIT 


?  , process 

PLEASE  SELECT  BIBLIOGRAPHIC  LEVEL  ?  3 
ISN  -  29008 

PERSONAL  AUTHOR-ANALYTIC 
?  Daniels?  W .  C , 

PERSONAL  AUTHOR-ANALYTIC 
?  Mae Cor mac?  C  ♦  W  * 

PERSONAL  AUTHOR-ANALYTIC 
?  Mulse  ?  J  ♦  H  ♦ 

PERSONAL  AUTHOR-ANALYTIC 
*? 

CORPORATE  AUTHOR-ANALYTIC 
CORPORATE  AUTHOR  NAME  -  ANALYTIC 
?  000711 


CORPORATE  AUTHOR  CODE-ANALYTIC 
•?  000711 

CORPORATE  AUTHOR  PART-ANALYTIC 
?  Agriculture?  Food  and  Nutrition 
MORE  OUuFlELDS ( Y/N )  ?  n 
A  FK I L I A  f  I  tJN-ANAL  YT  JC  C 
A!- FILIATION  CODE-ANALYTIC 


Sciences 


? 

AFF I L I A T I  ON  PART-ANAL YT I C 


Division 


TITLE-ANALYTIC 

?  Collaboration  in  os ri cultural  research 
TITLE-ANALYTIC 


PAGE  NOS* -ANAL Y  f  TC 
?  n.  9- IS 

PERSONAL  AUTMOR-MONOGRAPi  I 


at  this  point  tho  system  ohook3 
to  soo  if  the  title  is  alroady  • 
in  the  data  base  (ohook  for 
duplicates) 


CORPORATE  AUTHOR-MONOGRAPH 
CORPORATE  AUTHOR  NAME-MONOGRAPH 
•? 

CORPORATE  AUTHOR  CODE-NONOGRAPH 


MIS  PAGE  IS  BSSr  qtULI&Y  rRAflriQftfrr.tt 
TROM  (Kin’  f»U.U»ltiHSD  10  0DC 
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9 

CORPORATE  AUTHOR  PART-MONOGRAPH 
v 

MORE  SUBFIELBO(Y/N>  ? 
AFFILIATION-MONOGRAPH 
AFFILIATION  CODE-MONOGRAPH 
? 

AFFILIATION  PART-MONOGRAPH 
? 

TITLE-MONOGRAPH 

v 

COLLATION-MONOGRAPH 

•? 

VENDOR  CODE 

NON-VENDOR 

STATUS  CODE 
? 

STATUS  CODE  DATE  <DD/MM/YY ) 

?  ++ 

ESTIMATED  PRICE 
v 

ORDER  DATE  (DD/MM/YY) 

? 

MATERIAL  DUE  DATE  (DD/MM/YY) 

9 

DATE  RECEIVED  (DD/MM/YY) 

? 

CITATION  SOURCE 

9 

REQUESTED  BY} 

REQUESTOR  NAME 
? 

REQUESTOR'S  DIVISION 
*? 

MORE  SUBFIELDS (Y/N)  ? 

NUMBER  OF  COPIES r  FORMAT 
? 

ALLOCATION  AND  COMMITMENT  NUMBER 
v 

ACCOUNT  NUMBER 
v 

.ACQUISITIONS  NOTES 
? 

DIRE}  ADDRESS  (5  LINES) 

DIRE— NAME  LINE  1 
? 

DIRE— NAME  LINE  2 

9 

DIRE— ADDRESS  LINE  1 

9 

DIRE— ADDRESS  LINE  2 
*? 

DIRE— ADDRESS  LINE  3 
? 

LANGUAGES  OF  TEXT 
?  ENGL 
CONFERENCE 
CONFERENCE  NAME 
? 


h  $ 
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CONFERENCE  PLACE 
? 

CONFERENCE  DATE 
? 

CONFERENCE  COUNTRY 

MORE  SUBFIELDS (Y/N )  ? 
EDITION 


PUBLISHER 
PUBLISHER  NAME 
? 

PLACE 

? 

COUNTRY 

? 

ISBN 

? 

DATE  OF  IMPRINT/ISSUE 
?  Aud  ♦  1977 

LIBRARY  OF  CONGRESS  CARD  NUMBER 


CANABIANA  CONTROL  NUMBER 


At  this  stage  the  item  has  arrived  in  the  library  and  its  'status' 
is  changed  to  'inpro'  i.e.  in  process, 

VALID  FUNCTIONS  ARE t 
ENTRY 
INDEX 

LISTFORMAT 

MODIFY 

PRINT 

QUERY 

FXIT 

modify  function  chosen  so 

WHICH  FUNCTION  DO  YOU  WANT?  modify ^  that  data  can  be  modified 
MODIFY  VERSION  A. 02, 00  LOGON  MON*  JUL  10*  1978*  3J31  PM 

PLEASE  SELECT  DATA  BASE  OR  EXIT 
Tbiblio 

PLEASE  ENTER  FUNCTION  TO  SELECT  RECORD 

?isn*29008 

ISN  =  29008 

PLEASE  SELECT  FUNCTION 
?1  all 

29008  STATUS:  inpro 

CHANGO:  CHANG I : 

LANTEX:  ENGL  DATECJ  DATE  I  ♦  Aud.  1977 

PAUTHA:  Daniels*  U  *  C  ♦  MacConriac*  C*W»  Hulse*  J,H* 

CORPA  t  0007110007ilAdriculture*  Food  and  Nutrition  Sciences  Divi  „on 

TITLEA:  Collaboration  in  agricultural  research 

PA6ENA:  p.  9-18 

FACET  t 

ABST  : 

PLEASE  SELECT  FUNCTION 
?tra  al74  to  3144 
CORPORATE  AUTHOR-ANALYTIC 

0007H000711Atlriculturo*  Food  and  Nutrition  Sciences  Division 

CORPORATE  AUTHOR  CODE-ANALYTIC 

000711 


**  FIELD  TRANSFERRED 
PLEASE  SELECT  FUNCTION 
Ytra  al75  to  a  145 
CORPORATE  AUTHOR-ANALYTIC 

0007llAsEr.iculture r  Food  and  Nutrition  Sciences  Division 
CORPORATE  AUTHOR  PART-ANALYTIC 

Agriculture ?  Food  and  Nutrition  Sciences  Division 

AFFILIATION-ANALYTIC 

0007.11 

«  FIELD  TRANSFERRED 
PLEASE  SELECT  FUNCTION 
PLEASE  SELECT  FUNCTION 
fdel  'si? i 

CORPORATE  AUTHOR-ANALYTIC 
000711 

CORPORATE  AUTHOR  NAME  -  ANALYTIC 
000711 

«  FIELD  CORPNA  DELETED 
PLEASE  SELECT  FUNCTION 
?1  a 170 

«S  FIELD  CORPA  IS  NOT  DEFINED 
PLEASE  SELECT  FUNCTION 

v  ^  317 

«  FIELD  CORPCA  IS  NOT  DEFINED 
PLEASE  SELECT  FUNCTION 
?1  a 140 

AFFILIATION-ANALYTIC 

000711Adriculture»  Food  and  Nutrition  Sciences  Division 
PLEASE  SELECT  FUNCTION 
?add  datec 
DATE  OF  CONTENT 
?  1977 

PLEASE  SELECT  FUNCTION 
?chs  status 
STATUS  CODE 
in  pro 

PLEASE  ENTER  CHANGE 
Tin pro %p roof 
PLEASE  ENTER  CHANGE 
'? 

proof 

Please  select  function 

?add  chan.de 
LAST  CHANGE 
LAST  CHANGE  DATE 
?++ 

LAST  CHANGE  INITIALS 
?cs 

PI. EASE  SELECT  FUNCTION 


Following  this  the  item  is  abstracted  etc 
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APPENDIX  B 

SOME  EXAMPLES  OF  INPUT  FORMATS,  FORMS,  HANDLING  OF 
SPECIAL  SYMBOLS  AND  ENTRIES  IN  A  DATA  DICTIONARY 


Figure  B 1  shows  an  input  form  used  for  the  NASA  STAR  data  base.  Much  of  the  information  is  indicated  by 
marking  the  appropriate  position  on  the  page.  Some  information  c.g.  publication  data,  is  in  fixed  field  format. 

Another  example  of  a  fixed  field  format  is  that  used  for  the  Research  and  Technology  Work  Unit  Information 
System  (WUIS)  at  the  US  Defense  Documentation  Center  (DDC). 

Figure  B2  shows  the  data  encoded  on  a  single  punched  card. 

Figure  B3  shows  some  of  the  rules  for  handling  special  symbols. 

Figure  B4  shows  an  example  taken  from  the  data  dictionary  (format  description),  showing  the  instructions  for 
encoding  a  title.  The  label,  (tag  or  field  number)  is  “1 1  ’,  the  name  is  ‘Title’  and  it  has  a  length  of  1 60  characters  which 
must  be  alphanumeric. 


'i-  ~-9~  *<  •  ■ 
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Fig.Bl  Data  collection  form  for  the  NASA  (STAR)  data  base 
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Fig.B2  Part  of  the  fixed  field  input  format  used  at  the  US  Defense  Documentation  Center 


Fig.B3  Some  of  the  rules  for  handling  special  symbols  at  the 
United  States  Defense  Documentation  Center 
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Field  Title  (160  Alpha/Numeric) 
11 

Instructions 


A  brief  descriptive  title  for  the  work  unit  is 
mandatory  at  all  times  in  a  WUIS  record.  An  entry  is 
mandatory  on  all  NEW  transactions.  The  field  may  not  be 
deleted  by  a  modification  transaction.  The  first 
character  of  the  title  field  in  the  card  or  tape  record 
is  a  single-character  code  for  the  security  classifica¬ 
tion  of  the  title  (either  U,  C,  S,  or  T)  When  preparing 
cards  or  card  images  do  not  leave  a  space  between  the 
code  and  the  first  character  of  the  title  itself.  (Items 
classified  Top  Secret  must  be  forwarded  to  NSA,  and  will 
not  be  accepted  by  DDC.) 

Edit/Audit 


a.  The  field  may  never  be  blank  or  deleted. 

b.  The  first  character  of  the  title  must  be  U,  C, 
or  S  for  entry  in  the  WUIS  by  DDC. 

c.  The  security  classification  of  the  title  must  be 
equal  to  or  less  than  the  Summary  Security,  Field  5. 

Disposition 

a.  A  transaction  is  not  releasable  if  Field  11  is 
blank  on  a  NEW  transaction  or  deleted  by  a  modification. 

b.  A  transaction  is  not  releasable  if  the  title 
classification  is  other  than  U,  G,  or  S  or  if  the  title 
classification  is  higher  than  Field  5. 

Fig.B4  A  description  of  the  data  element  'title’  in  the  US  Defense  Documentation  Center  data  base 

(an  item  from  a  data  dictionary) 
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APPENDIX  C 

PROCESSING  OF  BIBLIOGRAPHIC  INFORMATION  AT  THE 
CANADIAN  NATIONAL  DEFENCE  INFORMATION  CENTRE 


Figure  Cl  shows  the  title  page  from  a  report  to  be  included  in  the  system. 

Figure  C2  shows  the  print-out  from  the  computer  used  for  proof-reading  purposes. 

Figure  C 3  shows  a  temporary  catalogue  card  and  a  second  card  used  for  loan  control. 

The  examples  shown  in  Figure  C2  and  Figure  C3  are  produced  by  the  computer,  which  also  produces  the  library 
catalogue  on  microfilm. 
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Fig.Cl  The  title  page  from  a  report  which  is  to  be  catalogued 
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Section  5 

MECHANIZATION  SYSTEMS  AND  OPERATIONS 
by 

Victor  J. Rogers 
Technology  Reports  Centre 
Department  of  Industry 
Orpington,  Kent,  UK 


ABSTRACT 


This  Section  provides  a  working  basis  for  setting  up  a  computer  system  for  indexing, 
processing  and  disseminating  scientific  and  technical  information  mainly  in  the  form  of 
bibliographic  references  to  research  report  literature.  Following  brief  refere"ces  to 
alternative  forms  of  mechanisation,  such  as  the  tape  typewriter,  the  main  emphasis  is  upon 
the  in-house  computer  and  its  uses  for  data  capture,  text  handling,  good-quality  printing 
and  information  retrieval.  Methods  are  described  for  setting  up,  storing  and  exploiting 
databases,  and  the  procedures  for  creating  an  announcement  journal  are  defined  in  detail. 

The  practical  examples  which  are  used  to  illustrate  the  work  are  based  upon  experience 
gained  at  the  Technology  Reports  Centre  (TRC)  of  the  UK  Department  of  Industry  and  the 
Defence  Research  Information  Centre  (DRIC)  of  the  UK  Ministry  of  Defence. 
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1.  INTRODUCTION 

Research  and  development  m  the  realms  of  science  and  technology  give  rise  to  a  flood  of  reports  and  special  litera¬ 
ture  which  is  generally  created  at  considerable  cost.  Modern  techniques  for  dealing  with  this  documentation  in  ways 
which  will  benefit  the  w  idest  possible  range  of  users  rely  increasingly  upon  mechanical  and  electronic  aids,  and  it  is 
evident  that  the  computer  is  the  most  significant  element  in  systems  designed  for  processing  scientific  and  technical  infor¬ 
mation  (STI).  This  applies  particularly  to  the  means  used  for  providing  existing  and  potential  users  of  research  report 
literature  with  lists  of  the  holdings  of  information  centres  and  special  libraries,  and  it  is  true  to  say  that  efficient  and 
comprehensive  exploitation  of  modem  data  bases  would  be  quite  imprac.icable  without  the  computer. 

Computer  manufacturers  and  specialised  software  houses  offer  a  variety  of  systems  which  can  be  employed  in 
mechanised  information  handling  processes  so  that,  when  planning  the  computerisation  of  an  existing  manual  system,  it 
is  wise  to  consult  hardware  manufacturers  and  organisations  with  experience  of  writing  reievant  software  If  it  is  found 
practicable  to  purchase  a  commercial  software  package,  i.e.  a  ready-made  set  of  working  programs,  it  is  generally  necessary 
to  make  significant  additions  to  the  system  in  order  that  the  requirements  of  the  local  situation  will  be  fully  met  This 
section  of  the  Manual  describes  the  steps  to  be  taken  in  order  to  build  up  a  computer-based  system  designed  both  to 
handle  the  collection  and  processing  of  bibliographic  references  to  scientific  and  technical  report  literature,  and  also  to 
disseminate  the  information  mainly  in  the  form  of  announcement  journals  and  similar  accession  lists  The  recommended 
systems  are  capable  of  providing  information  retrieval  by  direct  access  from  remote  computer  terminals 

The  practical  examples  which  are  used  to  illustrate  the  work  are  based  upon  experience  gained  at  the  Technology 
Reports  Centre  (TRC)  of  the  UK  Department  of  Industry  and  the  Defence  Research  Information  Centre  (DR1C)  of  the 
UK  Ministry  of  Defence. 


2.  BASIC  MECHANISATION 

Many  of  the  mechanical  devices  used  in  the  creation  and  processing  of  research  reports,  announcement  journals  and 
similar  documents  can  be  described  as  office  machinery.  This  includes  addressing  and  enveloping  machines,  some  kinds 
of  printing  machines,  special  cabinets  for  storing  and  displaying  index  cards  and  printed  indexes,  and  photographic 
devices  for  creating  pages  of  text  for  use  as  printing  masters  (plate-makers).  Such  equipment  can  be  adapted  to  reduce, 
for  instance,  the  tedium  of  information  retrieval  from  files  of  index  cards.  Some  of  these  card-based  devices  involve  the 
provision  of  coloured  tags  or  notched  edges  to  indicate  broad  subject  groups,  and  other  more  sophisticated  processes 
rely  upon  the  coincidence  of  holes  drilled,  with  considerable  precision,  in  special  cards.  The  computer  can  handle  this 
kind  of  basic  information  m  a  much  more  versatile  way,  and  rapid  developments  have  taken  place  in  the  field  since  the 
first  files  of  bibliographic  information  were  set  up  in  the  computer  by  reading  in  data  from  punched  cards  and  punched 
paper  tape. 


2.1  Tape  Typewriters 

•It  was  the  tape  typewriter  which  pointed  the  way  towards  the  computer-based  scientific  and  technical  information 
(STI)  system  and  operations  of  this  kind  are  described  in  detail  in  Section  4  of  this  Manual.  The  simultaneous  creation 
of  a  set  of  data  in  the  form  of  text  (‘hardcopy’)  and  as  machine-readable  codes  punched  onto  paper  tape,  not  only 
eliminated  repetitive  typing  but  provided  a  record  which  was  acceptable  to  the  computer.  Control  codes  related  to  a 
second  punch,  for  instance,  can  be  used  to  create  a  special  paper  tape  containing  selected  fields  which,  when  sorted  by 
means  of  standard  software,  provide  the  basis  of  effective  and  economical  printed  indexes.  The  judicious  selection  of 
a  clear  type-face  and  a  well-designed  format  control  program  loaded  into  an  auxiliary  reader  can  ensure  a  satisfactory  page 
layout  for  tae  mam  text  of  an  announcement  journal  (see  Figure  1),  and  ind'-x  pages  created  automatically  and  printed 
by  the  computer  have  proved  acceptable  even  when  photo-reduced  to  conform  to  the  required  page  size  The  implica¬ 
tions  of  the  photo-reduction  of  such  pages  are  dealt  with  in  detail  in  paragraph  4.18.2,  but  it  can  be  noted  here  that, 
where  facilities  exist  for  the  photo-reduction  of  text  according  to  a  variable  scale,  a  two-column  (‘two-up’)  page  format 
can  readily  be  achieved  by  the  appropriate  an.mgement  of  the  final  text  output  from  the  typewriter  at  the  camera 
copying  stage.  Useful  title  strips  can  also  be  added  at  this  stage  to  provide  extra  subject  headings  which  will  link  the  text 
with  the  index  (see  Figure  2>. 


2.2  Computerised  Indexes  via  the  Tape  Typewriter 


“Pei  issue"  indexes  of  the  kind  referred  to  above  are  generally  printed  from  transient  files  of  information  held  in  the 
computer,  that  is,  files  which  will  be  overwritten  by  the  next  increment  of  bibliographic  data  Magnetic  tape  storage  is 
relatively  inexpensive,  howeves,  and  provided  that  the  computer  has  sufficient  sorting  capacity,  the  tapes  can 
subsequently  be  used  very  economically  to  produce  cumulated  semi-annual,  annual  or  volume  indexes  by  running  the 
contents  of  several  of  them  together.  t  licse  indexes  can  be  photo-reduced  to  a  convenient  page  size  (generally  A4  in  the 
UK)  and  bound  to  form  a  separate  publication  which  is  useful  tor  information  retrieval  purposes  since  it  eliminates  the 
need  for  m<.nual  searching  through  incremental  indexes.  A  page  from  a  typical  index  is  shown  in  Figure  3 
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21  MATERIALS 


21-2  CERAMICS.  REFRACTORIES  AND  GLASSES. 


[1083-6911]  AD-643299  UNLIMITED 

American  Optical  Co.  ,Southbridge,M?.ss,  ,USA 
PREPARATION  OF  PLATINUM- FREE  LASER  GLASS  1.1-31.7.1966 
Woodcock,  R.F.  Granltsas,  G„A„  12  1966  1 5PP 

N0NR-4636-00  SAR-4 
UDC.  666. 1 , 03/666-22/621 . 373. 826 

The  purpose  of  this  project  was  to  produce  high  optical 
quality  laser  glass  free  of  platinum  particles,  Methods  to 
prevent  evaporation  of  various  oxides  that  caused  inhomo- 
geneities  in  the  glass  were  investigated.  Data  was  accumulated 
to  determine  the  effect  of  time,  temperature,  and  atmosphere 
on  the  weight  loss  of  platinum  and  alloying  of  platinum  with 
ingredients  m  glass.  TB 

21-6  METALLURGY  AND  METALLOGRAPHY 

mmmrnmm  m  — —  wnaww  ■  —  - 

[1084-6911]  P-198379  TT-1346  UNLIMITED 

Arch. ElsenhuttWes. , Vol. 37, No. 7, 1966, 949- 930, Germany 
EQUILIBRIA  OF  THE  I RON- CARBON- OXYGEN  SYSTEM  IN  THE  TEMPERATURE 
AND  CONCENTRATION  RANGE  OF  MOLTEN  STEEL  AND  THE  MANNER  IN 
WHICH  THESE  ARE  INFLUENCED  BY  PHOSPHORUS.  MANGANESE  AND  SULPHUR 
DIE  C-LEI CliGEWI CHTE  DES  SYSTEMS  EISEN-KOHLENSTOFF-SAUERSTOFFIM 
TEMPERATURE-  UND  K0N2ENTRATI 0NSBERE1 CH  DES  FLUSS1GEN  STAHLS  UND 
IHRE  BEEINFLUSSUNG  DURCH  PHOSPHOR,  MANGAN  UND  SCHWEFEL 
Schenck.H.  Hlnze.H.  1963  14PP  14ref 
UDC ,  660, 01 7, 1 3/66°. l 3' 784' 787/669 . 779/669.74/669.779 

The  reactions  cf  carbon  monoxide  and  carbon  dioxide  mixtures 
with  molten  iron  belong  to  the  essential  reactions  for  which 
it  Is  desirable  to  obtain  equilibrium  values  of  an  accuracy 
as  high  as  possible.  New  data  were  obtained  observing  all 
possible  precautions  to  obtain  the  desired  accuracy.  These 
data  included  the  influence  of  the  alloying  elements  phosphorus, 
manganese  and  sulphur  GMH 


fig.  1  Single-column  page  of  text  fiom  abstract  journal 
( tape-typewriter)  TRC  1969 


2.3  Computer-based  Data  Stores 

The  storage  of  textual  information  in  machine-readable  (digit.d)  form  is  becoming  less  expensive  as  techniques  are 
developed  for  the  economical  use  of  large  disc  stores.  Although  the  computer-held  files  used  specifically  for  the  produc¬ 
tion  of  printed  indexes  to  announcement  journals  are  transient  the  original  files  coni  lining  complete  bibliographic 
references  become  a  valuable  resource  when  held  in  suitable  computer  stores,  and  most  of  the  data  bases  in  common  use 
began  in  ibis  way. 


3.  THE  DATA  BASK 

The  computer  held  data  base  of  bibliographic  information  is  essentially  a  large  dyaimiK  file,  made  dynamic  by  the 
frequent  addition  of  new  data,  and  in  some  cases,  by  the  removal  o»  ledundant  mater, al.  I  igure  4  provides  an  indication 
of  the  way  in  which  the  file  is  buill  up,  and  the  Data  Conversion  stage  is  uishided  to  emphasize  the  necessity  for  adopting 
a  standard  data  format  When  a  set  of  elements  of  bibliographic  information  is  put  together  to  form  a  record,  or  item,  in 
a  data  base  file,  each  element  becomes  a  field  in  that  record  and  must  he  input  according  to  strict  rules  in  order  to 
conform  to  the  standard  format  Data  format  rules  are  referred  to  in  paragraphs  4.0  and  4.7.  and  a  typical  data  base  file 
format  is  similar  to  the  main  bibliographic  record  shown  m  f  igure  5.  Once  a  data  base  format  has  been  established  il 
can  be  very  costly  to  change  it  m  order  to  include  additional  elements,  but  conversion  fiom  one  format,  containing  the 
basic  bibliographic  data  fields  in  a  specific  sequence,  to  another  in  which  the  same  elements  appear  in  a  somewhat 
different  form  or  different  order  (eg  the  standard  data  base  format  adopted  for  the  system),  is  relatively  straighttorwaul 
in  a  computer  context  lints  a  single  data  base  can  be  made  up  ol  inlonnation  derived  in  machine-readable  torm  trout 
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14  METHODS  AND  EQUIPMENT 

C129&-2703] 

173*01367  AERa-H-2566 

itaalc  Energy  Res. But. .Harwell, UK 
METHODS  OP  SPECIMEN  PREPARATION  POR  TRANSMISSION  ELECTRON 
MICROSCOPY 

Harper .B.A.  10.1972  22PP  llref 

Availability:  HMSO  £0.50 

A  range  of  naterlals  have  been  txanlnod  lr.  tins  labai'ator;' 
resulting  In  the  development  of  tcchr.iquoc  for  tr.t  preparation 
of  thin  filas  for  transalsslon  eloctron  aloroooopy.  Apparatus 
for  obtaining  thin  areas  In  ceraalc  naterlals,  by  grinding, 
electro-polishing  nethods,  and  lor>-boaa  thinning  are  described. 
Suspension  nethods  for  sub-nlcron  powders,  rolling  techniques 
for  use  on  pure  netals,  and  the  use  of  the  ultia-nlcrotone  on 
biological  naterial  are  brleriy  described. 

Indexing  Terns:  ‘Electron  nloroeoopy/*Speolnenslblology  j/ 

Thin  fllns 


11299-27031 

773-01411 

Netlonal  Physical  Lab.  .Toddlngton.Nddx.  ,UK 
ASPECTS  OP  HOPE  ON  THE  PRIMARY  STANDARDS  OP  MEASUREMENT 
Dunworth.J.V.  Doan.P.  1972  I7PP  5ref 
Availability:  TRC  £0.50 

Discusses:  tine  and  frequoncyl  length!  electrical  standards! 
tcoperature!  nass!  fundamental  constants.  lIn:Proc.RS£(A)  ,Vol. 

70.No. 12, 1971/72) 

Indexing  Terns:  ‘Metrology /‘Standards/Physlcs/Ihysics 
laboratories 


1 1300- 2703) 

T7  3-01414 

National  Physloal  Lab.  .Teddlngton.Kddx.  ,UK 
TEN SI  ON- COM  Pit  ESSI  ON  STAGE  POR  A  SCANNING  ELECTRON-MI  ChOSCOPE 
Electron  Mloroscopy,5th  European  Congress, 1972 
Thcmw.lt.  Broalnrell.P.R.  1972  2PP 
Availability:  TRC  £0.50 

During  the  djnanlc  obsorvatlcns  of  apsolnen  na  o-  lals  undor 
strain,  the  Jaws  holding  the  speolnens  apply  atraln  eoually 

Two-single-column  pages  were  laid  side-by-side  under  the  camera  and  photo-reduced 
to  75%  of  the  original  size. 

Heading  and  sub-heading  strips  were  added  as  required  before  each  page  was 
photographed. 

Fig.2  Double-column  page  of  text  from  abstract  journal 
[tape-typewriter)  -TRC  1973 


a  variety  of  sources.  Attempts  continue  to  be  made  to  encourage  data  base  producers,  particularly  those  working  in  the 
fields  of  science  and  technology,  to  adopt  a  standard  format  at  source,  but  experience  indicates  that  such  principles  tend 
to  become  ponderous  and  inflexible  in  practice.  The  MARC  (MAJimc  Readable  Cataloguing)  system  was  developed  by 
the  US  Library  of  Congress  and  the  British  Library  to  provide  for  the  international  exchange  of  bibliographic  information 
and  this  is  probably  the  most  notable  example  of  a  standard  data  format. 

Although  large  disc  stores  can  provide  direct  access  to  vast  quantities  of  bibliographic  information  when  held  on¬ 
line,  it  is  usual  for  the  latest  accessions  to  be  in  the  greatest  demand  by  information  users.  This  leads  to  the  concept  of 
“levels  of  storage"  which  can  be  applied  to  eilect  useful  economies.  The  latest  information  can  be  held  on  disc,  with  the 
older  information  which  is  less  in  demand  being  available  on  magnetic  tape  ready  for  transfer  to  disc  in  response  to  a 
specific  request.  Users  of  such  systems  arc  expected  to  define  a  range  of  dates  when  requesting  information,  and  any 
charges  made  for  transferring  data  from  tape  to  disc  and  back  again  to  tape  are  reflected  in  the  overall  costings. 

3.1  Storage  of  Data  Within  the  Computer  (see  Glossary  of  Terms) 

When  setting  out  to  define  the  optimum  size  of  an  individual  record  within  a  disc  file,  it  iv  essential  to  take  account 
of  the  basic  hardware  design.  An  1CL  1 900-series  computer,  for  instance,  uses  a  24-bit  ‘word-,  a  1 28  word  ‘block’,  and 
‘buckets’  containing  1 , 2  4  or  8  blocks  of  data,  and  to  achieve  economy  in  the  use  of  storage  space  within  the  machine  or 
its  peripherals  it  is  advisable  to  set  up  files  of  such  a  size  that  they  fit  closely  within  these  boundaries.  Techniques  have 
been  developed  for  compressing  the  data  in  digital  form  within  individual  records  to  avoid  waste  of  space,  but  this 
procedure  is  not  so  effective  with  text  as  it  is  witli  files  of  numeric  data. 
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14-2  Laboratories,  Teat  Facilities  and  Test  Equipment 

(1295-2703) 

T7 3-01053  ABES- BIB-181 

it  calc  Energy  Ros.Eat.  .Harwell, UK 
LIQUID  CRYSTALS  POR  NONDESTRUCTIVE  TESTIN0 
Wall,M.A.  10.1972  llpp  78ref 
Availability:  HMSO  £0.50 

ttiie  bibliography  has  been  compiled  fraa  the  document 
collection  of  the  Nondestructive  Testing  Centre  Information 
and  Advisory  Service  and  covers  the  period  1963  -  1971.  It 
includes  papers  on  liquid  crystals  and  their  application,  par¬ 
ticularly  in  the  field  of  nondestructive  testing.  References 
are  listed  in  chronological  order  and  are  followed  by  an  author 
index. 

Indexing  Terms:  *Xiquld  orystals/*Mcndestruotivo  tests/ 
Bibliographies 


1 1 296- 2703 1 

17>01159  APCRL-72-0278 

Air  ?orce  Cambridge  Res. Labs .Hansocn  Fie Id, Has 6. .USA 
BARTH  MOTIONS  AND  THEIR  EFFECTS  ON  INERTIAL  I  NSW  WENT 
PERFORMANCE 

Gray.R.A.  Cabanlss.Q.H.  4.1972  6 3pp  57ref 

Availability:  TRC  £l.20  AP-SURV-0E0PHY3-239 

The  purpose  of  this  paper  is  to  delineate  and  explicate  the 
effeots  of  earth  notions  on  Inertial  Instrument  performance* 
to  oxanine  nethods  for  reducing  thoco  errors  induood  by  the 
earth  notion  environment ,  and  to  enphaslzo  the  need  fer 
expanded  research  along  these  linos. 

Indexing  Terns:  *Barth  novementsAlnertial  guidanoe/lnertial 
navlgatlon/Oyroucopos/fearth  cruct/tnstmaont  oharaoteristlos/ 
Environmental  notions/Znstruaent  performanoe/Aoouraoy/Pcrforaed 
standards/Earth  rotation 
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i  SUBJECT  INDEX  5 


SUBJECT  mplNQt  TITLE 

TRC  ACCESSION  NO. 

location 

COMPUTER  PROSCANS 

CONCEPTS  IN  PROGRAM  TESTING 
SOFTWARE  FOR  ADVISORY  CONTROL  Of 
STRESS,  STABILITY#  ANl*  VIBRATION 

TNI  NATIONAL  TRANSMISSION  SYSTEM 
os  complex  branched  shells  of  revolution. 

analys 

T73-01873* 

U3-01S5V* 

m-oi2»z 

1182-2703 
1 1 78-2705 
1391-2703 

COMPUTER  SYSTEMS  PROGRAMS 

F ACTORf 1NDER  11  VERIFIED  SOFTWARE  RR0DUCTS.  A  CATALOGUE 

WORKLOAD  MODEL  AND  MEASURES  FOR  COMPUTER  PERFORMANCE  EVALUATION 

T73-Q1734* 

T7J*0tm 

1176-2703 

1183-2705 

COMPUTERIZED  SIMULATION 

MULT INETHOD-MULT I TRAIT  VALIDATION 

of  a  digital  simulation  model 

T73-00591 

1051-2703 

COHPUTEkS 

APPLICATION  OF  AN  ON-LINE  COMPUTER  TO  PHYSICS  EXPERIMENTS  IN  2EDR' 
COMPUTER  COURSES  -  POST  GRADUATE  AND  FIRST  DEGREE 
COMPUTER  COURSES  -  GOAL  I  FT 1NG 

COMPUTERS  IN  OFFICES  m2,  MANPOWIR  STUDIES  NO  '.2 

insuring  a  computer  ststen 

MINICOMPUTER  APPLICATION  (APRIL  IDSP-JUIY  ID71) 

OPERATING  THE  RADIO  DEPARTMENT  REMOTE  COMPUTER  TERMINAL  EOUIPNENT 

PAYE  AND  COMPUTERS.  COMPUTERGUIDE  9 

VALUE-ADDED  TAX,  SOON  1 

VALUE-ADDED  TAX,  DOOR  2 

VALUE-ADDED  TAX,  8(iOA  5 

CONCRETES 


USfc  OF  ACCELFKATED  STRENGTH  TESTS  FOR  THE  SJTE  CONTROL  Of  CONCRETE 

T73-01 59  / 

1 193-2703 

CONICAL  BODIES 

HEAT  TRANSFER  AND  SURFACE  PRESSURE  MEASUREMENTS  Ow  TWO  CONICAL  WINGS  IN  FREE 

FLIGH 

T73-00249 

1347-2703 

consoles 

DEVELOPMENT  CF  GRAPHIC  AREA  DISPLAYS  FOR  ASW  ATTACK  MANAGEMENT  SIMULATION  VUl 

1  (1 

T73-01168 

1316-2703 

CONSTRAINING 

RESTRAINED  COLUMN  DESIGN 

T73-01S83 

1289-2703 

CONSTRUCTION 

LIST  OF  PUBLICATIONS  bY  STAFF  OF  STRUCTURAL  ENGINEERING  DIVISION  Of  BUILDING 
STRUCTURAL  PERFORMANCE  EVALUATION  OF  INNOVATIVE  BUILDING  SYSTEMS 

RESEA 

T73-01532 
T73-01 511 

1287-2703 

1285-2703 

contamination 

CONTAMINATION  PROBLEMS  WITH  SPACECRAFT  (IN  GERMAN) 

T73-01 687 

1422-2703 

T73-01D34 

1324-2703 

T73-01 871  * 

1180-2703 

T73-01 872* 

1181-2703 

T73-01 81 3* 

1177-2703 

T73-U1878* 

1183-270 3 

T73-01133 

1173-2703 

T73-00566 

1169-2703 

T73-01800* 

1184-2703 

T73-01887* 

1046-2703 

T73-Q1 888* 

1047-2703 

T73-0188V» 

1048-2/03 

The  indexes  to  each  issue  of  the  abstract  journal  were  printed  in  upper  case  on  an  1CL 
1900-series  computer  and  photo-reduced  to  about  60rf  of  the  original  size  to  fit 
conveniently  on  the  page. 

The  input  to  the  computer  was  the  punched  paper  tape  created  by  the  tape-typewriter 
as  the  main  text  of  the  journal  was  being  typed. 


Fig.3  lixtract  from  subject  index  TRC  1973 


Ideally,  the  abstract  of  each  report  held  in  the  data  base  should  form  part  of  the  record,  but  this  can  increase  the 
record  size  from,  say  400  characters  to  more  than  1500  characters  for  the  normal  item  of  bibliographic  information  held 
in  an  S 1 1  system.  Thus  it  is  considerations  of  field  size,  record  size,  and  file  size  which  exercise  a  fundamental  influence 
upon  the  practical  ami  financial  feasibility  of  any  data  base. 

It  has  been  pointed  out  that  ST1  data  bases  are  mostly  built  up  of  regular  increments  of  data.  This  is  quite  a 
convenient  arrangement,  since  it  keeps  the  latest  information,  whkh  is  generally  most  in  demand,  in  the  most  accessible 
position  in  the  total  file.  Holding  a  data  base  in  chronological  order  calls  for  reliable  machine  held  indexes  to  facilitate 
access  by  subject,  author,  report  number,  etc.,  and  this  makes  the  ‘vveeding-out’  of  individual  report  references  a  very 
complex  matter. 

3.2  The  Computerised  Thesaurus 

Computerised  information  retrieval  systems  based  upon  a  dictionary  of  approved  indexing  terms  (descriptors) 
require  a  machine-readable  thesaurus  to  facilitate  subject  searches,  Figure  (>  shows  an  extract  from  the  IJ(.\  Thesaurus 
ot  hiigmeermg  &  Scientific  lerms  t  IT  SI ).  and  unlike  the  data  base  itself,  the  thesaurus  file  requires  periodic  reorganisa 
••oil.  with  terms  of  various  kinds  being  added,  amended  or  deleted  It  is  convenient  to  control  these  activities  by  holding 
the  master  file  on  punched  chards.  and  1  igure  ’  indicates  the  procedure  lor  updating  such  a  file  with  software  designed 
to  ensure  that  additional  terms  are  posted  correctly  ,  i.e.  are  added  not  only  a-  new  main  terms,  but  also  as  broader  or 
narrower  terms,  suitably  annotated,  under  their  relevant  mam  term  headings  Considerable  intellectual  intervention  is 
required  to  ensure  that  each  new  term  is  fully  integrated  into  the  thesaurus  and  a  sy  stem  of  codes,  to  be  punched  into  the 
cards  m  addition  to  the  text,  can  be  used  to  enable  the  update  program  to  make  the  amendment.,  a,  required  The 
handling  ol  alterations  to  and  deletions  from  the  terms  in  the  file  is  generally  simpler  than  making  additions,  but  prec  im* 
planning  is  required  to  keep  the  thesaurus  file  up-to-date,  and  a  complete  update  run  is  necessary  at  regular  intervals  when 
a  check  is  also  made  to  ensure  that  the  latest  set  of  interpreted  punched  cards  is  correct  On  line  updating  from  a  V1HT 
terminal  is  feasible,  but  it  is  generally  less  reliable  because  of  the  greater  possibility  of  human  error 
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Project  Number 
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Availability  and  Price 
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Further  Reference  Number 
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Abstract 
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Indexing  Terms 

4000 


Notes: 

1.  Maximum  record  length  —  4000  characters 
Average  record  length  =  1 500  characters 

2.  Each  field  size  includes  the  end-of-field  marker. 

3.  The  field  size  quoted  for  Field  20  includes  all  sub-field  markers  and  the 
end-of-record  marker. 

4.  If  the  document  is  a  translation  the  title  field  (Field  7)  will  end  with  the 
identity  of  the  country  of  origin  abbreviated  as  necessary  and  held 
between  brackets  preceded  by  a  sub-field  marker. 

Fig. 5  Main  bibliographic  record  (input  file  structure) 


4.  THE  IN-HOUSE  COMPUTER 

4.1  Initial  Preparations 

4. 1. 1  The  Feasibility  Study  is  the  essential  pre-requisite  to  any  computer-based  STI  system.  Depending  upon  the  size 
of  the  installation  and  the  time  available,  tne  study  should  be  carried  out  by  an  individual  ora  small  team  with  experience 
of  the  organisation  and  management  of  office  procedures  but  with  a  greater  knowledge  of  the  implications  of.  and 
opportunities  for,  the  automatic  processing  of  textual  data.  Consultants  from  outside  the  organisation  affected  should  be 
expected  to  confer  with  management  at  frequent  intervals  dung  the  study  to  ensure  that  the  plans  develop  in  the  right 
direction.  I>  cases  where  the  computer  is  being  installed  to  take  over  or  extend  operations  already  being  performed  by 
manual  methods  or  by  some  form  of  basic  mechanisation  every  step  within  each  activity  must  be  precisely  documented 
and  fully  unuerstood  both  as  a  separate  function  and  also  as  it  affects  other  operations  within  the  total  system.  Careful 

a  Italy's  of  this  kind  can  show  up  ways  in  which  current  procedures  may  be  improved,  for  it  is  often  found  that  activities 
which  are  unnecessary,  uneconomical  or  redundant  have  been  allowed  to  become  an  accepted  part  of  »hc  system.  Thus 
detailed  systems  analysis  as  a  prelude  to  computerisation  can  prove  useful  even  if  the  actual  introduction  of  the  computer 
is  delayed. 

4. 1.2  Special  Accommodation  Careful  planning  is  needed  to  ensure  that  the  special  accommodation  required  to  house 
the  computer  and  its  ancillary  equipment  is  ready  on  time,  A  frequent  source  of  delay  is  the  creation  of  the  controlled 
environment  needed  by  the  larger  machines  (generally  those  with  more  than  16  000  bytes  of  core),  although  many  of  the 
mini-computers  referred  to  in  Chapter  five  will  operate  satisfactorily  in  the  kind  of  air-conditioning  which  is  found  in 
modem  office  accommodation.  In  most  cases  it  will  be  necessary  to  build  some  special  accommodation  or  to  carry  out 
modifications  to  an  existing  building,  and  it  is  wise  to  start  on  this  part  of  the  overall  task  at  the  earliest  possible  time 
and  to  monitor  progress  most  carefully.  As  well  as  the  machine  itself  there  must  be  accommodation  for  computer 
operators,  progranintv.s  and  data  preparation  staff,  all  of  whom  must  be  recruited  and  trained  in  good  time.  The  cost 
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of  introducing  a  computer  is  such  that  it  might  prove  uneconomical  to  let  the  machine  stand  idle,  so  that  the  provision 
of  a  rest-room  and  a  simple  kitchen  should  be  considered  at  an  early  stage  in  order  to  facilitate  shift  working. 

4.1  3  Back-up  facilities  -  Although  the  computer  aspects  of  ST1  handling  are  highly  specialised  in  many  respects,  means 
should  be  sought  of  carrying  out  as  many  operations  as  possible  on  an  alternative  machine  in  the  event  of  a  serious  break¬ 
down.  Subject  to  security  considerations,  this  objective  can  sometimes  be  achieved  by  setting  up  a  system  of  mutual  help 
between  the  ST1  organisation  concerned  and  a  research  establishment  or  similar  institution  which  has  a  computer  of 
suitable  size  and  of  the  same  manufacture. 

4.1.4  Interna!  reorganisation  -  When  the  main  constraints  of  budget,  scope  and  time-scale  have  been  laid  down  for  the 
computerised  system,  it  is  vital  to  set  up  procedures  for  regular  communication  between  the  newly  formed  Automatic 
Data  Processing  (ADP)  section  and  the  internal  groups  most  affected  by  the  inevitable  function  re-organisation  which  will 
take  place.  A  functional  chart  which  could  apply  to  an  Information  Centre  is  shown  in  Figure  8,  and  it  will  be  seen  that 
liaison  is  necessary  at  all  levels  of  management.  This  is  particularly  important  where  the  new  system  is  to  be  super¬ 
imposed  upon  well-established  procedures  with  the  object  of  ultimately  replacing  them.  If  the  first  output  from  the 
computer  can  be  shown  to  bring  about  an  obvious  improvement  to  an  existing  operation,  or  to  create  a  new  product 
which  fulfils  a  long-feit  need,  then  those  members  of  staff  whose  daily  work  is  to  be  affected  by  the  machine  will  be  more 
likely  to  take  a  sympathetic  view  of  subsequent  developments. 

4.2  Input  File  Structures 

It  has  already  been  pointed  out  that  the  computer  storage  of  bibliograpnic  records  which  include  abstracts  can  create 
problems  of  capacity  and  cost.  These  problems  become  greater  if  an  attempt  is  made  to  store  the  complete  text  of  reports, 
but  they  are  not  altogether  insoluble  given  adequate  resources,  and  advanced  systems  of  this  kind  will  be  referred  to  when 
considering  the  implications  of  information  retrieval  later  in  this  chapter. 

There  are  many  types  of  file  structure  which  can  be  used  to  hold  bibliographic  records  in  the  computer,  but  there 
are  two  basic  kinds  which  will  be  described  in  some  detail.  The  ideal  system  will  probably  incorporate  aspects  of  both  and, 
as  knowledge  and  expertise  develop,  refinements  such  as  data  compression  techniques  can  be  introduced. 

4.2  1  The  single  bibliographic  file  -  Provided  that  there  are  adequate  resources,  it  is  quite  possible  to  store  all  the 
bibliographic  information  which  is  to  be  processed  in  the  form  of  individual  records  in  a  single  main  bibliographic  file. 

The  research  report  literatur  processed  by  the  Technology  Reports  Centre  of  the  UK  Department  of  Industry  is  handled 
ii  this  way,  and  it  is  against  this  background  that  the  more  generalised  systems  described  in  this  chapter  are  defined.  A 
typical  record  is  shown  in  Figure  5,  and  it  can  be  seen  that  it  consists  of  twenty  fields,  including  the  complete  abstract, 
arranged  in  a  manner  which  is  dictated  by  the  need  to  display  the  various  elements  in  a  convenient  sequence  on  the  VDU 
screen  as  the  data  are  being  input.  The  average  length  of  such  records  is  about  1500  characters,  but  the  system  should 
allow  for  records  up  to  40'J0  characters  in  length. 

An  impoitant  factor  governing  the  sequence  of  the  fields  in  this  type  of  system  is  the  maximum  number  of 
characters  allowed  in  a  single  line  of  the  display,  remembering  that  each  space  ;s  counted  as  a  character.  Many  VDUs 
have  a  line  length  of  80  characters  so  that,  for  instance,  the  two  author  fields,  the  date  and  the  pagination  information, 
which  together  occupy  a  possible  maximum  of  79  characters  inclusive  of  spaces  will  fit  conveniently  into  a  single  line  of 
text  (see  Figure  9).  The  software  is  therefore  designed  to  accept  these  data  and  store  them  in  that  order.  On  the  other 
hand,  the  originator  field,  known  within  many  systems  as  the  corporate  author  field,  can  contain  up  to  150  characters 
but  rarely  exceeds  80.  The  software  will  therefore  handle  as  many  lines  as  each  field  requires  at  the  input  stage  and 
provide  the  correct  display  format  when  called  upon  to  do  so. 

Experience  within  TRC  has  proved  that  there  art;  no  serious  information  retrieval  problems  created  by  restricting  the 
number  of  authors  identified  in  the  input  file  to  two.  The  field  sizes  throughout  the  record  arc  rarely  found  to  be 
inadequate,  and  it  has  been  shown  that,  for  quick  reference  purposes,  most  reports  can  be  identified  by  the  first  thirty 
characters  of  the  title. 

Certain  fields  within  the  record  will  inevitably  contain  sub-fields,  i.e.  discrete  elements  within  the  field  which  must 
be  flagged  for  use  by  the  software  for  special  purposes.  The  indexing  terms  (descriptor)  field  is  constructed  in  such  a  way 
xhat  the  individual  terms  can  be  extracted  (see  paragraph  4.7.o)  and  listed  for  information  retrieval  purposes.  Each 
descriptor  is,  therefore,  a  sub-field  and  is  separated  from  its  neighbour  by  a  code,  the  oblique  (/)  in  Figure  9  which  is 
recognised  by  the  software.  In  the  example  shown  in  Figure  10  the  question  mark  (?)  forms  the  field  separator  and  the 
left-hand  square  bracket  (I)  is  the  record  separator.  The  decision  concerning  the  number  of  records  which  will  constitute 
an  incremental  file  will  depend  upon  «  wide  range  of  factors,  the  most  important  of  which  arc  the  average  number  of 
documents  passing  through  the  system  in  a  given  period  (say  a  week),  the  avenge  size  of  bibliographic  record,  the  average 
number  of  records  per  printed  page  of  the  announcement  journal,  the  required  frequency  of  journal  publication  and  the 
optimum  number  of  pages  per  incremental  issue  of  the  journal.  The  data  handling  capacity  of  the  computer  and  its 
peripherals  will  also  be  very'  significant,  but  the  adminis.  .Rive  decisions  involved  in  th-se  aspects  of  the  total  task  of  the 
ST!  system  will  have  been  taken  into  account  in  the  course  of  the  original  feasibility  study. 
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Fig.9  A  typical  abstract 
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T78-6188?  blrd-54i6??  3809? 

British  Library  Lending  Div., Boston  Spa, Wetherby, Yorks  LS23  7BQ? 
AUTHOR-PUBLISHER  RELATIONSHIPS  IN  SCHOLARLY  PUBLISHING??? 

Mann,  P.  II.??  02.1978?  83pp???? 

BLED?? 

With  the  help  of  live  scholarly  publishers,  seventy- two  books  were  examined 
from  the  viewpoint  of  both  authors  end  publishers  to  find  out  how  they  came 
to  be  written  and  published.  It  was  found  that  publishers'  editors  are 
themselves  very  active  in  seeking  ideas  for  new  books  ana  the  scholars  who 
act  as  their  series  editors,  consultants  and  advisors  form  a  very  important 
communication  network  linking  now  authors  to  publishers.  Early  contact  with 
this  network  con  help  authors  greatly  and  save  unnecessary  work  and  possible 
disappointment . ? 

♦Publishing/* Authors/Education/Books//05B/?  [ 

Fig.  1 0  Bibliographic  record  displayed  with  control  codes 


bach  incremental  main  bibliographic  file  is  terminated  by  a  unique  code,  say  a  double  left  hand  .square  bracket  (II). 
and  is  handled  as  an  entity  in  the  system  right  through  to  the  stage  at  which  it  is  used  for  the  printing  o'"  text  and  indexes 
I  lie  data  fields  and  control  codes  arc  then  subjected  to  special  software  which  converts  the  Hie  to  the  formats  required  by 
the  data  base  programs  and  the  incremental  file  becomes  pari  of  the  total  data  base  system. 

■12. 2  Parallel  files  A  useful  alternative  to  the  main  bibliographic  file  is  the  system  of  parallel  files.  The  simplest 
application  of  this  system  is  to  hold  all  fields  except  the  abstract  in  one  file  (the  Reports  File)  arid  to  hold  abstracts  only . 
identified  by  the  m-house  document  accession  number,  in  a  parallel  file  (the  Abstracts  File)  The  Reports  File  will  be 
quite  compact  with  each  record  containing  200  to  *100  characters  m  the  system  shown  in  Figure  1 1 ,  and  it  will  have 
many  uses  within  the  total  system,  particularly  for  ‘housekeeping’  and  other  management  purposes,  c  g  the  control  of 
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document  issues  and  periodic  statistics  on  work  progress.  Printing  and  display  programs  which  call  for  abstracts  will  need 
to  have  the  capability  of  linking  in  with  the  Abstracts  Hie  when  complete  bibliographic  records  are  required. 

The  need  for  absolute  piecision  on  the  part  of  the  keyboard  operators  who  create  the  input  file  is  particularly  signi¬ 
ficant  in  the  case  of  those  fields  which  hold  reference  number  information,  since  a  mis-quoted  number  can  make  a 
document  irretrievable  within  the  system.  Tor  systems  employing  punched  cards  or  punched  paper  tape  as  input  it  is 
usual  to  key  the  data  twice  using  different  operators  and  then  to  compare  the  two  sets  of  codes  automatically. 

The  systems  being  described  here,  however,  are  based  upon  direct  input  from  VDU  terminals  with  single  keying,  it 
is  then  feasible  to  write  alpha  numeric  and  character  counting  checks  into  validation  software  to  protect  the  input  against 
errors  in  fields  which  are  susceptible  to  inaccuracies  of  this  kind.  Those  fields  which  contain  textual  data  car.  only  be 
properly  corrected  by  proof  reading  and  text  editing.  Fields  which  form  the  main  entries  to  indexes  are  particularly 
sensitive  to  keying  errors,  and  this  is  especially  true  of  the  Corporate  Author  index  based  upon  the  Originator  field.  The 
slightest  textual  error,  such  „s  misquoted  abbreviations  or  the  use  of  incorrect  punctuation  will,  if  undetected,  cause  the 
computer  to  create  a  new  heading  identical  in  all  other  respects  with  the  proper  heading,  thus  impairing  the  accuracy  and 
reliability  of  the  index.  There  is  a  strong  case,  therefore,  for  establishing  a  master  file  of  originators  as  an  authority  file 
cross-referenced  to  the  file  of  bibliogiaphic  infoimation,  so  that  the  keyboard  operator  needs  only  to  insert  a  reference 
number  in  the  originate  field  of  the  input  file,  leaving  the  software  to  pull  out  the  corresponding  corporate  author 
information  from  the  master  file  when  a  complete  record  is  required. 

A  six-figure  number,  representing  up  to  999  999  different  corporate  authors,  is  considered  sufficient  for  indexing 
purposes  in  the  parallel  file  system,  and  it  is  far  easier  to  apply  a  visual  check  to  such  a  number  than  it  is  to  proof-read  the 
complete  corporate  author  identity  with  its  array  of  capital  letters,  special  punctuation  and  abbreviations. 

The  system  of  parallel  files  could  be  extended  to  cover  personal  authors  also,  but  such  developments  can  generally 
be  regarded  as  refinements. 

4.3  Storage  File  Structures 

The  software  which  is  used  to  count  the  data  fields  and  to  validate  such  fields  as  the  accession  number,  main  subject 
code  and  date  in  the  single  main  bibliographic  file  system  can  be  extended  to  re-sort  the  data  elements  when  they  have 
been  corrected  and  to  eliminate  unwanted  spaces  from  the  record.  Figure  12  shows  a  typical  storage  Tile,  arranged  with 
the  main  sort  Fields  (main  subject  code,  accession  number,  originator's  reference  number  and  related  document  numbers) 
at  the  top  of  each  record.  In  most  cases  the  first  operation  involving  the  storage  file  is  the  printing  out  of  an  increment 
of  the  semi  monthly  announcement  journal,  and  a  sequential  number  (record  numberj  is  generated  by  the  software  for 
use  as  a  control  within  the  suite  of  programs  which  creates  the  printed  output  for  each  increment.  The  record  number  is 
not  normally  displayed. 

Systems  which  employ  parallel  files  for  the  storage  of  data  can  only  operate  efficiently  if  the  data  elements  which 
make  up  the  records  are  enhanced  by  the  addition  of  special  control  codes  (keys)  and  links  at  tiie  data  preparation  stage. 
The  software  which  controls  the  files  will  then  recognise  the  keys  and  links  and  use  them  as  pointers  to  the  data  elements 
in  the  various  files  when  it  is  necessary  to  assemble  records  and  display  them  in  specific  formats. 

4.4  Output  File  Structure  (Fig.  1 3) 

The  principle  function  of  the  Output  File  in  STI  systems  is  to  display  bibliographic  information  either  on  the  VDU 
screen  or  the  printed  page.  The  structure  of  the  VDU  displays  will  be  described  in  paragraph  4.5  (Data  Preparation),  and 
the  structure  of  the  print  file  used  to  create  the  pages  of  a  typical  announcement  journal  will  be  referred  to  m  paragraph 
4  18  on  printing.  A  variety  of  other  output  formats  is  required  for  data  validation  purposes  and  to  provide  brief  details 
of  the  documents  passing  through  the  system  for  various  housekeeping  functions,  so  that  it  is  important  to  ensure  that 
control  codes  are  retained  within  the  text  to  provide  keys  for  data  selection  and  display.  Programs  can  then  be  provided 
to  list  any  individual  fields  or  combination  of  fields  for  work-flow  contiol  purposes,  e.g.  from  accession  numbers  only  to 
the  main  bibliographic  record  complete  with  the  symbols  which  represent  the  control  codes.  Tins  latter  feature  is 
particularly  important  in  a  system  which  prepares  output  on  magnetic  or  punched  paper  tapes  for  photo-typesetters. 

4.5  Data  Preparation  Systems  -  General 

Reference  has  already  been  made  to  the  use  of  punched  cards  and  punched  paper  tape  as  input  io  STI  systems,  and 
there  are  specific  areas  in  which  these  media  can  be  employed  to  advantage.  The  modem  in-house  computer  generally 
lends  itself  to  input  via  visual  display  units  however,  and  there  are  great  benefits  to  be  gained  from  the  instant  display  ol 
text  on  the  screen  as  the  data  are  being  captured  in  machine  readable  form.  The  same  bioad  rules  governing  the  structure 
of  the  data  elements  within  the  record  and  the  structure  of  the  records  within  the  file  will  apply  whatever  data  prepara¬ 
tion  technique  is  employed,  but  the  trend  is  towards  entering  data  direct  from  VDU  terminals,  and  it  is  the  essential 
features  of  this  type  of  system  which  will  be  described  in  detail. 


70 


Field 

Number 

Maximum 
Field  Size 
(  Characters) 

Contents 

0 

4 

Record  Count  (Total  number  of  characters  in  record) 

1 A 

4 

Record  Number 

1 

4 

Mam  Subject  Code 

2 

30 

Accession  Number 

3 

25 

Originator’s  Kefsreiur  Number 

4 

23 

Agency  Reference  Number 

5 

25 

Project  Number 

6 

25 

Other  Reference  Number 

7 

25 

Fv,  riher  Referer.ee  Number 

S 

29 

Contract  Number  and  Period  Reference 

9 

9 

Date  of  Report 

10 

P 

Pages  and  References 

11 

25 

First  Author 

12 

25 

Second  Author 

13 

25 

Availability  and  Price 

I-* 

5 

Increment  Number 

15 

150 

Originator 

16 

200 

Title  in  English 

17 

300 

Title  in  Original  Language 

18 

300 

Conference  Details 

19 

2091 

Abstract 

20 

565 

Indexing  Terms 

20A 

1 

Record  Terminator 

Fig.  1  2  Typical  storage  file  structure 


Field 

Number  Con  Mils 

0  Record  Count  (Total  number  of  characters  in  record) 

1  Main  Subject  Code 

2  Accession  Number 

3  Originator’s  Reference  Number 

4  Agency  Reference  Number 

5  Project  Number 

6  Other  Reference  Number 

7  Further  Reference  Number 

8  Contract  Number  and  Period  Reference 

9  Date  of  Report 

10  Pages  and  References 

1 1  First  Author 

12  Second  Author 

13  Availability  and  Price 

14  Increment  Number 

1 5  Originator 

1(>  Title  in  English 

17  Title  in  Original  Language 

18  Conference  Details 

1 9  Abstract 

20  Indexing  Terms 

20A  Record  Terminator 

Fig.  1 3  Output  file  structure 


4.6  cCey-to-di.se  Input  from  Terminals  Linked  to  an  in-house  Computer 

A  typical  key-to-disc  data  prepration  system  is  illustrated  in  Figures  14  and  15. The  "Data  Prep  Room  in  which  the 
input  VDIJ  terminals  are  housed  should  be  carpeted,  quiet  and  comfortable  to  provide  the  operators  with  working  condi¬ 
tions  in  which  they  can  carry  out  their  very  exacting  work  with  the  minimum  of  distraction.  It  is  particularly  important 
to  arrange  the  lighting  in  such  a  way  that  eye-strain  is  eliminated.  Ideally  the  Data  Prep  Room  should  be  near  enough  to 


Summary  of  the  data  preparation  and  abstract  journal  production  systems  at  TRC 
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the  main  Compute  Room  to  enable  the  VDU  terminals  to  be  wired  directly  into  the  machine.  Where  the  document 
flow  is  sufficient  to  justify  the  use  of  a  duster  of  VDU’s  it  is  a  good  principle  to  arrange  the  terminals  in  pairs  so  that  two 
adjacent  operators  can  take  it  in  turn  to  check  each  others’  work. 

The  recording  of  the  bibliographic  data  defining  each  document  which  enters  the  system  is  controlled  by  means  of 
a  Data  Lntry  Form  or  Document  Processing  Sheet  (see  Figure  16).  The  preparation  of  the  form  can  be  entirely  manual 
or  partly  by  computer  as  described  in  paragraph  4.22  which  deals  with  housekeeping  and  stock  control.  The  form  travels 
to  the  Data  Prep  Room  with  the  parent  document  and,  to  minimise  errors  of  transcription,  as  much  bibliographic 
information  as  possible  is  copied  by  the  operator  direct  from  the  title  page  of  the  report.  Each  day's  Input  is  treated  as 
a  unit  for  control  purposes,  and  the  Daily  Input  rite  is  built  up  on  disc  as  the  complete  bibliographic  data  defining  each 
document  are  keyed  in. 

4.7  The  Data  Fields 

Referring  to  Figure  5,  it  can  be  seen  that  most  of  the  data  fields  are  self-explanatory.  The  general  considerations 
which  govern  the  structure  of  the  data  within  the  Input  File  are  referred  to  in  paragraph  4.2,  but  some  of  the  fields  have 
special  characteristics  which  are  worthy  of  note. 


4. 7.1  Field  1  Mam  Subject  Code.  This  is  the  three-character  code  assigned  by  the  document  analyst  at  the  abstracting 
stage.  The  code  is  cross-referenced  by  the  software  to  a  parallel  file  which  contains  the  master  list  of  the  subject  headings 
and  sub-headings  which  has  been  adopted  by  the  ST1  system  a»  a  whole.  The  list  of  COSATI/NTIS  codes  has  been  found 
to  be  dependable  in  this  context,  and  the  brief  but  precise  statement  of  subject  identity  which  constitutes  Field  1  will  be 
found  to  have  a  variety  of  uses  in  the  STI  system  (see  notes  on  Field  20). 


4. 7.2  field  2  -  Accession  Number.  This  is  the  number  given  to  the  document  when  it  becomes  part  of  the  total  store 
of  research  report  literature.  It  is  usual  to  employ  a  system  which  numbers  the  documents  sequentially  within  the  year 
of  acquisition,  so  that  each  annual  series  begins  with  the  year’s  identity  followed  by  document  Number  One. 

4. 7.3  FieldS — Increment  Number.  It  can  be  assumed  that  the  original  Feasibility  Study  (see  paragraph  4. 1 . 1 )  or 
existing  practice  will  decide  the  size  and  frequency  of  the  issues  of  the  STI  system’s  announcement  journal.  The  contents 
of  each  Issue  of  the  journal  will  form  an  increment  of  the  total  throughput,  and  the  decision  concerning  the  number  of 
issues  to|bC  published  lira  year  will  depend  upon  the  quantity  of  reports  handled,  the  rate  at  which  they  are  processed, 
the  optimum  journal  size,  etc.  Twelve  monthly  issues  may  conveniently  constitute  an  annual  volume  and  be  numbered 
accordingly,  but  it  is  advisable  to  introduce  multiple  volumes  in  a  year  if  more  than  twelve  issues  are  to  be  published  in 
that  timj^.— A-convement-arrangemen^-is-to  assign  twenty-four  or  twenty -six  issues  in  a  year  to  two  volumes  each  of  twelve 
or  thirteen  increments,  anjd  the  increment  number  shown  at  the  top  right  in  the  example  (Tig.  1 0)  would  be  used  in  such 

a  systeni(lS051jsJncrentent  2_o.fVo.Lume  ^.8). 

4. 7.4  Fields  10  and  lit  First  and  Second^ Author.  These  fields  are  formatted  to  suit  the  requirements  of  the  software 

which  c(eates-the~Author ilndex, -i.e-the-surname. first  for  ease  of  sorting  into  alphabetical  order.  There  need  be  no  restric 
tion  on  file  number  of  initials  allowed  provided  that  the  total  permitted  field  size,  including  spaces  and  punctuation,  is 
not  excdededT"  '  !  '  ! 

4. 7.5  fieldJJ  -  Date.  There  are  many  ways  in  which  dates  are  traditionally  expressed,  and  the  method  shown  in 
Figure  TO  is  a  useful  compromise.  Sufficient  dating  precision  is  achieved  by  quoting  only  month  and  year,  and  the  all 
n umeric*presen ta tion-is  clear  and  relatively  easy  for  checking  by  means  of  the  validation  software. 


4. 7.0  field  20  indexing  Terms.  These  can  be  in  the  form  of  keywords  (alpha)  or  subject  codes  (generally  alpha/ 
numeric)  and  the  software  should  be  designed  to  take  accou:  of  multi-word  terms,  provided  that  they  do  not  exceed  the 
inaximuiicleagta allowed. for_the.indi.vidual.descnpl.ors..A.Uia,\imum  length  of  forty  characters  has  been  found  to  be 
generally  acceptable,  only  ptesentingiproblems  to  the  document  analyst  when  lie  is  dealing  with  complex  chemical 
compoundsj  Field  20ns  divided  into  sub, fields  (groups  of  descriptors)  by  a  main  sub-field  separator  which  is  a  double 
oblique  t//j  in  the  example_$hown  in  Figure  10,  and  into  sub-sub-fields  (individual  descriptors)  by  the  secondary 
separator  which  is  shown  as  a  single  oblique  (/).  This  arrangement  permits  the  software  to  separate  the  keywords  and  the 
subject  codes-for-hsting  purposes and,.at.the.sam6  tune,.to.givc.equaI  status  to. every  term  used,  be  it  alpha,  alpha, 'numeric 
or  numeric,  for  information  retrieval  purposes.  The  printed  Subject  Index  will  be  based  upon  the  terms  used  in  Field  20, 
and  it  has  been  found  convenient  to  designate  some  or. all  of  the  First  five  descriptors  as  Subject  Headings  The  Indexing 
Terms  field  begins  therefore  with  up  to  five  descriptors  which  arc  asterisked  to  enable  the  indexing  software  to  recognise 
them  and  copy  them  over  to  a  special  file  for  use  as  headings  in  the  Subject  Index.  Thus  they  are  used  both  for  informa 
tton  retrieval  and  for  printing  purposes.  There  are  other  special  kinds  of  descriptor  such  as  the  “Identifier”  which  has 
value  for  retrieval  purposes  in  relation  to  a  specific  report,  but  is  not  required  in  the  thesaurus.  In  the  example  shown  in 
Figure  9,  identifiers  are  marked  with  a  cross  which  enables  them  to  be  given  special  treatment  by  the  processing  software 
Where  the  COSAFI/NFIS  system  of  subject  classification  is  adopted,  the  first  alpha  numeric  code  following  the  sub  field 
separator  is  designated  as  the  Mam  Subject  Code  and  is  copied  to  the  top  of  the  record  (T ield  1  in  the  example  shown) 
where  it  exercises  important  control  functions  in  the  system  software  (see  paragraph  4.7. 1 ). 


I  2.  Accciilon  Number 


DATA  ENTRY  FORM 

DOCUMENT  PROCESSING  SHEET 


I  1.  Conti  Reference 


.1  Originator*  Reference 


'  4  Agency  Ref  or  Marking  on  Document 


6 

Originator 

L 

Title 

9 

Conference:  Title.  Place  and  Date 

10 

Futt  Author 

11  Second  Author 

12.  Date 

13  Pages  &  Refs 

) 

>  PP  ref  ? 

14 

Contract  No 

> 

15  Project  No 

i 

IP  Other  Ref 

) 

n 

A v. (lability  &  Price 

18  Further  Ref. 

> 

> 

IS 

ABSTRACT: 

} 

120  indexing  Term* 


Received 

Inf  Mali |  Date 


InltUli  I  Date 


Checked 

InltUli  I  Date 


No  of  <  opiei 


Fig.  1 6  Document  processing  sheet  TRC  1  ‘>78 
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4.8  Separator  Codes 

It  is  essential  that  the  (.odes  selected  to  operate  as  sub  field,  field,  record  and  file  terminators  should  never  appear  as 
characters  m  the  text  of  the  bibliographic  record.  A  typical  function  of  the  sub  field  separator  (,')  has  been  described  in 
paragraph  4.7.0,  and  the  system  shown  in  Figure  10  uses  the  question  mark  (?)  as  field  separator.  Records  and  files  are 
terminated  by  a  single  left-hand  bracket  (band a  double  left-hand  bracket  ([[)  respectively,  and  care  should  be  taken 
when  choosing  these  very  significant  codes  to  ensure  that  they  can  be  printed  by  the  system  for  validation  purposes  and 
that  they  do  not  have  a  unique  function  in  the  system  software  provided  by  the  computer  manufacturer.  A  well-designed 
validation  program  will  test  for  the  presence  or  absence  of  separator  codes,  and  will  reject  any  records  in  which  these 
codes  are  improperly  used.  In  the  system  chosen  for  the  purpose  of  illustration,  a  record  having  data  in  all  twenty  data 
fields  is  a  very  rare  occurrence  but,  nevertheless,  every  record  must  contain  twenty  field  separators  in  the  correct 
sequence  if  it  is  to  be  accepted.  The  presence  of  adjacent  question-marks  indicates  that,  for  the  record  shown  in  Figure 
10  there  is  no  information  present  in  those  particular  fields. 

4.9  Keying-in,  Validating  and  Correcting  Data 

Textually  accurate  input  is  essential  for  the  creation  of  consistent  output  which  will,  in  turn,  provide  the  user  with  a 
good  impression  of  the  dependability  of  the  STi  system  as  a  whole.  It  can  be  deduced  from  Figure  14  that  the  data  can 
be  validated  seven  times  under  the  TRC  system,  twice  at  the  VDU  as  the  data  are  being  captured,  twice  by  software  using 
the  Validation  Program,  and  three  times  from  text  in  the  form  of  computer  print  outs  which  can  be  created  each  time  a 
sorted  input  file  has  been  set  up.  It  is  essential  that  the  text  editing  software  should  allow  items  of  bibliographic  informa 
tion  to  be  called  back  to  the  VDU  screen  from  the  Input  File  in  response  to  keying  in  the  appropriate  accession  number. 
An  acceptable  working  system  could  be  based  upon  a  2000-character  VDU  screen,  an  Input  Hie  made  up  of  complete 
bibliographic  records  of  up  to  4000  characters  each  and  a  screen  format  created  by  the  keyboard  operator  to  resemble 
the  layout  of  records  on  the  ultimate  printed  page.  The  keyboard  operator,  working  from  the  Document  Processing 
Sheet  (Fig.lo)  and  the  parent  document,  enters  the  data  in  the  correct  sequence,  observing  the  rules  governing  the  format 
of  each  field,  and  inserts  control  codes  as  required.  At  the  end  of  the  record,  or  having  filled  the  screen  with  data  in  cases 
where  the  record  exceeds  2000  characters  in  length,  the  operator  reads  the  contents  of  the  screen  and  corrects  obvious 
textual  errors  by  use  of  the  edit  features  of  the  data  prep  software.  Having  completed  this  first  validation,  the  operator 
clears  the  screen  by  depressing  the  “SEND”  key  which  consigns  the  data  to  the  Input  File.  It  is  important  to  ensure  that, 
for  records  in  excess  of  2000  characters,  the  last  line  of  text  is  retained  in  the  VDU  buffer  and  displayed  at  the  top  of  the 
otherwise  blank  screen  when  the  operator  prep..res  to  complete  the  item  by  keying  in  a  second  “page”  or  screen  full  of 
data.  This  line  of  text  is  frequently  part  of  the  abstract,  and  this  retention  technique  enables  the  operator  to  maintain 
continuity  without  risk  of  error.  The  Input  Program  will  be  designed  to  make  the  data  from  the  two  “pages"  completely 
contiguous  in  the  file.  Systems  employing  VDU's  with.  say.  a  1280  character  capacity  (16  lines  each  of  80  characters), 
might  have  to  accept  more  than  two  screen-fulls  of  data  per  record,  depending  upon  the  maximum  record  size  allowed 
for  at  the  system  design  stage. 

Having  incorporated  one  validation  operation  into  the  initial  data  capture  process,  the  second  validation  is  carried 
out  «t  the  terminal  by  pairs  of  operators  working  as  partners.  It  has  already  been  recommended  that  input  terminals  be 
arranged  in  pairs  and,  if  two  operators  spend  the  first  half  of  the  working  day  independently  keying  (resl.  items  into  the 
Input  File,  they  can  exchange  the  morning's  source  documents  and  spend  the  second  half  of  the  day  checking  each 
other’s  work.  Fins  is  done  by  calling  b.n  k  the  records  from  the  Input  File  by  their  accession  numbers  and  pioof  reading 
from  the  screen.  The  day's  work  is  then  submitted  to  the  Validation  Program  in  the  computer  which,  in  a  text  handling 
system,  can  only  carry  out  elementary  checks  of  the  kind  indicated  in  paragraph  4.10.  Records  which  are  not  acceptable 
to  the  program  are  rejected  completely  and  the  related  documents  are  set  aside  to  be  keyed  in  afresh  with  the  next  day's 
work. 

To  provide  the  means  of  carrying  out  the  next  two  stages  of  data  validation  it  is  necessary  to  be  able  to  print  out  the 
day's  work  on  two-part  continuous  stationery,  preferaoly  using  a  fast  printer  which  works  in  upper  and  lower  case, 
printing  all  control  codes.  Fhe  data  prep  supervisor  then  reads  over  one  copy ,  looking  mainly  for  consistency  of  sty  lc  and 
recon!  content,  while  the  editor  of  the  announcement  journal  checks  the  other,  ensuring  the  accuracy  of  layout  and 
technical  content  and  also  examining  the  consistency  of  indexing  practice,  etc.,  on  the  part  of  the  information  scientists 
who  analyse  the  original  documents. 

I  igure  14  refers  to  a  system  which  creates  twenty-four  issues  of  an  announcement  journal  in  two  volumes,  each  of 
twelve  issues,  in  a  year.  Fins  system,  therefore,  is  based  upon  a  semi  monthly  production  cy  cle,  and  the  output  of  the 
various  related  groups  within  the  parent  organisation  is  gauged  to  fit  within  this  requirement.  The  implications  of  this 
aspect  of  STI  processing  are  dealt  with  in  detail  m  paragraph  4.1b  but  they  are  referred  to  at  this  point  in  order  to 
emphasize  the  need  for  co-ordinating  the  various  activities  which  make  up  the  overall  work  pattern.  At  the  end  of  each 
data  prep  cy  cle  a  time  is  allowed  for  keying-in  the  final  corrections  and  the  incremental  input  file  is  then  run  against  the 
validation  program  which  will  list  and  reject  any  items  which  it  cannot  accept.  The  next  stage  is  to  print  out  the 
complete  increment  on  the  high-speed  printer  and  submit  it  to  the  journal  editor  who  will  see  that  the  corrections  which 
he  has  previously  requested  have,  u>  fact,  been  made.  The  instructions  can  then  be  given  to  print  the  pages  of  the  journal 
or  to  create  the  magnetic  tape  which  will  drive  the  photo-typesetter. 
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Sever,  stage  validation  may  seem  excessive,  but  the  nature  of  the  task  is  such  that  opportunity  must  be  given  to 
check  that  those  errors  which  have  been  discovered  have  actually  been  corrected.  It  is  only  necessary  foi  the  keyboard 
operators  and  the  data  prep  supervisor  to  proof-read  every  word  of  the  text  in  a  system  of  this  kind. 

4.50  Operator  Aids 

4,10.1  'Free  7)  ping'M  hie  The  data  prep  system  so  far  described  only  enables  the  computer  to  assist  the  operator  at 
the  validation  stage,  relying  almost  entirely  upon  human  skill  and  expe  ionce  to  create  records  which  are  correct  in  every 
detail.  It  is  a  characteristic  of  text  handling  systems  that  the  aids  which  are  provided  by  the  computer  can  only  be 
minimal  and,  in  the  TRC  s,  tern  for  instance,  they  are  applied  to  the  main  subject  code,  accession  number,  increment 
number,  author,  date  and  indexing  terms  fields.  All  of  these  except  the  author  fields  are  checked  for  length  (individual 
descriptors  will  have  a  maximum  length,  and  the  other  fields  listed  above  will  have  a  fixed  length)  and  all  except  the 
indexing  terms  field  have  a  fixed  format  which  can  be  checked  by  program.  Each  complete  record  is  scanned  to  ensure 
that  the  correct  number  of  field  separators  is  present,  that  there  are  no  superfluous  spaces  in  the  text,  and  that  the  control 
codes  are  only  placed  where  they  arc  required  to  be.  The  software  incoroorates  a  repertoire  of  error  messages  which  give 
the  operator  brief  details  of  any  errors  found  by  the  computer,  and  the  program  rejects  faulty  records. 

4  10.2  'Controlled  Typing' Mode  The  operator  aids  described  in  paragraph  4. 1 0. 1  apply  to  a  system  which  is  based 
upon  traditional  typing  procedures.  Such  systems  offer  two  advantages  in  that,  by  relying  on  the  skill  and  accuracy  of 
the  keyboard  operator,  the  input  control  software  can  be  greatly  simplified  and  the  operator  is  given  a  better  degree  of 
job  satisfaction,  generally  taking  pride  in  getting  things  right.  It  is  quite  feasible,  however,  to  set  up  a  system  which 
eliminates  all  formatting  requirements  at  the  input  stage  and  expects  the  operator  to  simply  key  in  strings  of  data  in 
response  to  prompts  on  the  VDU  screen.  A  controlled  typing  system  of  this  kind  might  display  the  names  of  the  various 
data  fields  in  “inverse  video",  i.e.  dark  characters  on  a  white  background  set  up  in  a  local  area  of  the  screen,  and  provide 
advice  concerning  the  inclusion  of  control  codes  (separators,  etc)  in  the  text  as  it  is  keyed  in.  The  validation  software, 
which  must  be  running  continuously,  can  be  designed  to  apply  more  extensive  checks  than  are  feasible  for  the  free  typing 
mode,  and  the  detection  of  an  error  might  be  signalled  by  causing  the  label  of  the  field  concerned  to  flash  on  the  screen. 
Systems  based  upon  controlled  typing  of  this  kind,  which  fill  the  input  file  with  unfoi  matted  strings  of  data,  require  very 
efficient  formatting  software  to  facilitate  proof  reading  and  to  ensure  that  the  bibliographic  records  can  be  displayed  to 
suit  the  required  page  layout  in  the  announcement  journal. 

4.11  Indexes 

Three  main  forms  of  index  are  generally  required  for  an  STI  system  which  prints  a  journal  and  operates  a  data  base. 
These  consist  of  internal  machine  indexes  to  facilitate  processing,  printed  indexes  for  publications  and  subject  indexes 
stored  in  the  computer  for  information  retrieval  purposes. 

4  11  1  MiiJiith  indexes  The  internal  indexes  used  as  aids  in  processing  the  data  are  normally  invisible  to  the  operator, 
functioning  only  within  the  system  software.  As  records  are  added  to  the  mam  disc  file  a  disc  index  is  built  up  auto¬ 
matically  ,  relating  each  accession  number  to  the  disc  address  at  vv  Inch  its  associated  bibliographic  information  is  to  be 
found.  The  data  files  require  frequent  sorting  and,  by  use  of  these  indexes,  it  is  only  the  accession  number, address  file 
which  needs  to  be  sorted,  thus  effecting  economies  in  time  and  resources.  When  it  is  necessary  for  the  complete  records 
to  be  printed  or  displayed,  the  software  refers  to  the  sorted  disc  index  file  which  provides  pointers  to  the  required 
bibliographic  data  in  the  unsorted  main  bibliographic  file  (see  Figure  14). 

4  1 1.2  Printed  indexes  Figure  1 7  shows  a  system  of  parallel  files  used  for  the  creation  of  printed  indexes  for  publica¬ 
tions.  A  typical  issue  of  an  announcement  journal  would  be  indexed  by  subject,  author,  report  number  and  accession 
number,  w  ith  an  incremental  subject  code  index  for  control  purposes.  \\  hen  volume  indexes  arc  produced,  combining  the 
incremental  indexes  over,  say ,  six  months  or  a  year,  the  repertoire  might  be  extended  to  include  title,  corporate  author 
and  conference  indexes. 

It  is  important  to  observe  that,  when  the  input  to  the  system  is  being  treated  as  a  senes  of  data  fields  in  software 
terms,  it  is  an  alpha, ’nuni'Tic  sort  based  on  the  Main  Subject  Code  field  at  the  beginning  of  each  record  which  enables  the 
records  to  be  placed  in  their  subject  groups.  It  is  this  operation  which  sets  up  that  part  of  the  system  which  will  display 
the  bibliographic  file  for  each  increment  in  printed  page  sequence.  The  subject  codes  are  listed  in  the  contents  pages  at 
the  begirtnning  of  each  issue,  along  with  the  numbers  of  the  pages  upon  which  they  can  be  found.  To  locate  a  complete 
report  reference  of  which  he  knows  the  accession  number,  the  usci  needs  only  to  be  additionally  aware  of  its  main  subject 
heading  and  the  number  of  the  increment  of  the  abstract  journal  in  which  the  item  was  announced,  and  lie  is  quickly 
pointed  to  the  page  or  group  of  pages  in  which  it  must  appear  (see  Figure  18).  The  abstracts  arc  arranged  on  the  printed 
pages  in  accession  number  order  within  subject  group. 

The  report  title,  its  main  subject  code  and  the  related  increment  number  are  the  three  data  fields  commonly  required 
in  every  set  of  printed  indexes,  although  it  may  be  considered  that  the  title  is  redundant  in  the  Report  Nuinbci  Index 
which  simply  lists  all  the  numbers  which  refer  to  a  specific  report  against  the  main  accession  number  of  that  report. 

Again,  sorting  takes  place  only  on  a  smaller  file  when  setting  up  the  various  indexes  (see  Figure  1 7 ).  Each  index  file  is 
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Fig.  i  7  A  typical  journal  index  system 
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CONTENTS 

Page 


1.  AERONAUTICS 

OlA  Aerodynamics  1 

01C  Aircraft  1 

OID  Aircraft  Flight  Control  and  Instrumentation  5 

2 .  AGRICULTURE 

02A  Agricultural  Chemistry  6 

02C  Agricultural  Engineering  7 

02D  Agronomy  and  Horticulture  7 

4 .  ATMOSPHERIC  SCIENCES 

04A  Atmospheric  Physics  7 

5 .  BEHAVIORAL  AND  SOCIAL  SCIENCES 

05A  Administration  and  Management  8 

05C  Economics  9 

05E  Human  Factors  Engineering  11 

05H  Man-machine  Relations  11 

051  Personnel  Selection,  Training,  and  Evaluation  12 

05 J  Psychology  (Individual  and  Group  Behavior)  13 

05K  Sociology  14 

6.  BIOLOGICAL  AND  MEDICAL  SCIENCES 

06A  Biochemistry  15 

05 B  Bioengineering  15 

06C  Bicic.ry  16 

06D  Bionics  17 

06 E  Clinical  Medicine  18 

OSH  Food  18 

06M  Microbiology  18 


Fig.  1 8  Sample  of  a  contents  page  (R  &  D  abstracts  based  on  COSAT1  subject  codes) 


created  automatically  from  the  main  bibliographic  file,  and  contains  only  the  principal  index  elements  (the  authors’ 
names  for  the  author  index,  the  report  titles  for  the  title  index,  etc)  and  the  disc  addresses  at  which  the  remainder  of  the 
data  fields  required  to  make  up  each  index  are  to  be  found.  When  sorting  is  complete  the  General  Index  File,  containing 
all  the  index  information  in  unsorted  form,  is  run  in  association  with  the  sorted  file  of  selected  index  elements  to  produce 
the  required  print-out.  The  suite  of  computer  programs  which  creates  the  printed  indexes  for  each  issue  of  the  announce¬ 
ment  journal  can  be  so  arranged  that  the  programs  for  the  individual  indexes  are  linked  together  to  provide  a  continuous 
printing  operation  and,  at  the  end  of  the  run,  the  incremental  General  Index  File  is  retained  for  archival  purposes.  A 
number  of  these  incremental  files  are  combined  when  it  is  necessary  to  create  cumulated  indexes,  but  the  same  software 
can  be  used,  allowing  time  for  the  larger  sorting  and  printing  operations,  so  that  the  appearance  of  the  index  pages 
remains  unchanged. 

4. 1 13  Indexing  for  Information  Retrieval  -  Software  Features.  The  systems  of  indexing  used  for  information  retrieval 
fall  into  two  categories  -  those  used  for  tracing  reports  within  the  stock  control  files  and  those  used  for  subject  searches. 
The  latter  make  use  of  keywords  and  subject  codes  for  locating  report  references  within  the  files  of  bibliographic  informa¬ 
tion  which  constitute  the  data  base.  The  stock  control  files  are  described  in  paragraph  4.22  and  piocedures  for  informa¬ 
tion  retrieval  by  subject  are  generally  based  upon  inverted  files  of  the  kind  described  in  Section  7,  Vol.UI  of  the  Manual. 
The  inverted  files  used  in  this  context  can  be  briefly  defined  as  bibliographic  reference  files  in  which  the  keyword,  and 
not  the  record  number,  identifies  the  record.  Thus  a  basic  inverted  file  for  subject  retrieval  would  consist  of  a  list  of  the 
keywords  used  in  the  system,  with  each  keyword  followed  by  a  set  of  reference  numbers  defining  the  individual  records 
to  which  that  keyword  has  been  allocated.  It  is  essential  that  the  data  prep  rules  concerning  the  structure  of  the  key¬ 
words  and  the  subject  codes  used  for  retrieval  purposes  be  carefully  observed,  and  that  the  separator  symbols  and  other 
control  codes  in  the  Indexing  Terms  field  (Fig.10)  be  correctly  inserted  in  the  records. 

The  retrieval  of  report  references  using  the  Indexing  Term  field  is  relatively  straightforward.  The  inverted  index  files 
belonging  to  the  data  base  are  updated  as  each  increment  of  the  announcement  journal  is  added  to  the  store  (see  Figure  19) 
so  that,  against  each  descriptor  in  the  file,  there  is  a  pointer  to  the  location  of  every  posting,  i.e.  each  report  reference  in 
the  system  to  which  that  particular  descriptor  has  been  allocated.  The  subject  interest  expressed  by  the  user  is  defined 
in  the  form  of  a  list  of  descriptors  linked  by  Boolean  logic  ('and',  ‘or’,  ‘not’),  and  this  search  statement  is  run  against  the 
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Fig.  1 9  Creation  and  update  of  database  Tiles 
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inverted  file.  When  a  match  is  obtained  between  a  search  term  and  a  term  in  the  file,  the  software  codes  which  act  as 
pointers  to  the  report  references  linked  to  tiie  file  term  are  stored  as  a  set  within  the  user  file  created  by  the  computer  for 
the  search,  and  the  total  number  of  these  ‘hits’  is  displayed.  The  sets  are  manipulated  by  the  computer  according  to  the 
‘and’,  ‘or’,  ‘not’  logic  included  in  the  search  statement  and,  as  the  search  proceeds,  the  user  can  judge  the  success  or  other¬ 
wise  of  his  search  by  the  number  and  quality  of  the  hits  held  in  his  latest  set.  A  well-designed  system,  with  large  on-line 
storage  facilities  and  a  good  response  time  between  the  user’s  terminal  and  the  information  store,  will  allow  a  searcher  to 
interrupt  and  redirect  his  search  at  any  stage,  and  will  display  selected  bibliographic  references  as  required.  It  is  valuable 
also  to  save  time  and  resources  by  providing  a  choice  of  display  formats  which  can  be  selected  by  the  use  of  a  parameter 
code  keyed  in  by  the  user  when  he  gives  the  print  or  display  command  at  the  conclusion  of  the  retrieval  phase  of  the 
search.  The  three  optional  formats  generally  offered  are  (a)  accession  numbers  only,  (b)  bibliographic  information 
without  abstracts  but  including  the  descriptor  field,  and  (c)  complete  records  inclusive  of  abstracts. 

Once  the  inverted  file  system  has  been  set  up,  and  provided  that  there  is  sufficient  on-line  storage  capacity  available, 
it  becomes  feasible  to  extend  the  information  retrieval  capability  to  cover  data  fields  in  the  record  other  than  the 
descriptor  field.  Authors,  for  instance,  can  be  added  to  the  Indexing  Term  field  by  software  and  subjected  to  straight¬ 
forward  matching  logic  for  retreival  purposes.  Also,  depending  upon  the  resources  available,  features  can  be  built  into  the 
program  which  will  help  the  searcher  to  deal  with  mis-spelled  keywords  and  elementary  keying  errors. 

The  Title  field  can  also  be  automatically  extracted  from  the  main  bibliographic  record,  stripped  of  words  of  little 
contextual  significance,  and  set  up  in  an  inverted  file  for  information  retrieval  purposes.  Existing  software  techniques 
can  then  be  applied  to  allow  the  user  who  wishes  to  base  his  search  upon  two  significant  terms  to  either  retrieve  all  report 
references  which  contain  the  two  selected  terms  in  their  titles  or,  in  more  sophisticated  systems,  to  retrieve  only  those 
references  whose  titles  contain  the  two  chosen  terms  in  proximity  to  one  another,  say  up  to  seven  words  apart. 

Database  systems  which  provide  the  users  with  abstracts  can,  where  resources  permit,  have  their  retrieval  systems 
extended  to  apply  this  proximity  logic  to  all  searchable  fields  including  the  abstract  field.  As  the  technology  de /clops 
and  data  storage  oecomes  cheaper  it  becomes  increasingly  feasible  to  hold  the  text  of  complete  reports  in  the  computer. 
Proximity  logic  is  a  valuable  tool  for  information  retrieval  in  full-text  systems. 

The  information  retrieval  files  so  far  described  deal  with  complete  descriptors,  which  can  be  multi-word  (with  or 
without  hyphens)  and/or  alpha-numeric,  provided  that  they  meet  the  requirement  that  the  total  number  of  characters, 
including  spaces,  within  a  single  descriptor  must  not  exceed  the  maximum  which  the  system  is  designed  to  handle.  The 
machine-readable  thesaurus  file  should  be  accessible  by  the  user  throughout  his  search  and,  if  he  wishes  to  refer  to  the 
thesaurus  at  any  time,  he  must  be  able  to  resume  his  search  at  the  point  at  which  it  was  interrupted.  The  thesaurus  file 
is  an  example  of  a  structured  file,  i.e.  each  descriptor  appears  as  a  main  term  (see  Figure  6)  followed  by  its  various  related 
terms  which  are  held  in  their  appropriate  classes  (broader  terms,  narrower  terms,  etc).  A  sub-set  of  the  thesaurus  file  is 
maintained  which  simply  lists  the  main  terms  in  alphabetical  order,  and  this  file  also  should  be  available  to  the  user  when 
he  is  carrying  out  his  search. 

As  well  as  handling  complete  descriptors,  the  system  should  permit  the  user  to  employ  truncated  terms.  A  control 
code  which  can  be  printed  but  which  does  not  appear  in  normal  text  can  be  selected  to  control  this  function  and, 
assuming  that  the  asterisk  is  used  for  the  purpose,  appropriate  software  will  interpret  ‘COMPUT*’  as  an  instruction  calling 
for  all  terms  in  the  alphabetical  list  which  begin  with  the  six  characters  ‘COMPUT’  to  be  displayed.  This  facility  is  known 
as  ‘‘back-end  truncation”  and  it  provides  the  searcher  with  a  very  useful  retrieval  tool.  “Front-end  truncation”  is  not  so 
straightforward,  but  it  can  be  achieved  by  setting  up  a  special  version  of  the  alphbetically-arranged  file  of  main  terms  in 
which  each  descriptor  is  reversed.  The  user’s  search  terns  are  similarly  reversed  before  the  character  comparison  logic  is 
applied,  so  that  ‘‘NETWORKS’  keyed  in  at  the  search  terminal  can  be  interpreted  by  the  software  as  an  instruction  to 
display  a  list  of  all  the  descriptors  in  the  thesaurus  which  end  with  the  characters  ‘NETWORKS’. 

The  fact  that  the  total  number  of  postings  is  stored  with  each  descriptor  in  file  inverted  file  enables  the  user  to 
display  the  number  of  ‘hits’  resulting  from  a  search  at  any  stage  of  his  retrieval  activity.  The  user  will  terminate  his  search 
when  he  achieves  a  reasonable  number  of  'hits’  of  the  required  quality  and,  having  displayed  n  few  at  the  terminal  for 
reference  purposes,  he  should  be  able  to  print  out  all  his  search  results  off-line  at  the  computer.  It  is  helpful  if  the 
searcher  can  react  with  the  retrieval  program  in  cn  interactive  (conversational)  mode  so  that,  for  instance,  he  can  be 
helped  from  the  start  to  state  his  search  parameters  correctly.  Towards  the  end  he  should  be  offered  the  option  of  the 
various  print  or  display  formats  described  and  finally,  live  choice  of  carrying  out  another  search  or  logging  off.  The  total 
retrieval  activity  should  be  protected  by  a  system  of  user  passwords,  and  the  final  'log-otT  instruction  should  initiate  a 
display  on  the  VDU  screen  of  the  time  spent  in  the  various  files  and.  where  applicable,  the  cost  incurred  by  the  user  for 
his  search. 

4.12  Kcy-to-disc  input  from  Free-standing  Tcrinina’  Stations 

One  of  the  principal  applications  of  computers  within  the  business  office  environment  has  come  to  be  known  as 
‘word  processing’,  a  term  which  is  generally  applied  to  the  printing  of  correspondence  and  internal  reports  by  computer- 
based  procedures.  Manufacturers  of  word  processing  equipment  have  tended  to  build  free-standing  ‘intelligent’  terminals 
incorporating  a  VDU  keyboard,  a  slow  printing  device  capable  of  producing  good  quality  characters  in  upper  and  lower 


case,  an  expandable  memory  and  an  exchangeable  dibv.  unit  which  us  generally  eased  upon  a  ‘floppy '  disc  (diskette).  As 
an  alternative  to  connecting  standard  VDU's  into  an  in  house  computer,  it  is  feasible  to  create  ST1  input  on  word 
processing  terminals  and  to  subsequently  transfer  the  data  set  up  on  the  exchangeable  discs  to  the  larger  discs  of  an  in 
house  machine. 

4.13  Key-to-tape  Input  Systems 

A  third  possibility  is  the  preparation  of  STI  input  by  the  use  of  shared  logic  systems.  These  generally  consist  of 
relatively  simple  VDU  keyboard  terminals  connected  by  cable  to  a  central  magnetic  tape  unit  which  incorporates  the  soft 
ware  which  provides  'prompts’  to  the  operators  and  carries  out  basic  validation.  Where  more  than,  say,  six  terminals  are 
attached  to  a  single  tape  unit,  it  is  usual  to  incorporate  a  supervisor's  VDU  from  which  checks  can  be  earned  out  on  the 
quality  and  quantity  of  the  input  generated  by  the  individual  operators.  Problems  can  arise  however,  due  to  the  fact  that 
magnetic  tape  files  are  essentially  sequential.  It  is  difficult  therefore  to  make  the  proof  reading  sequence  different  from 
the  original  input  sequence,  and  the  selection  of  individual  report  references  from  the  tape  for  display  purposes  becomes  a 
slow  process.  The  output  magnetic  tapes  however,  can  be  handled  by  standard  software  and  hardware  and  each  tape  has 
many  times  the  capacity  of  a  floppy  disc. 

4.14  Input  from  External  Sources 

There  are  many  advantages  to  be  gained  if  individual  agencies  which  process  STI  by  computer  make  direct  contact, 
both  loc.dly  and  on  a  world  wide  basis,  with  other  organisations  carry  mg  out  similar  activities.  It  is  generally  found  that 
individual  organisations  will  adopt  internal  file  formats  which  will  meet  local  requirements  and  each  can  be  expected  to 
build  its  own  data  base.  The  most  costly  part  of  any  comprehensive  STI  system,  in  terms  of  both  time  and  money ,  is 
the  data  preparation  and,  for  organisations  which  have  areas  of  subject  interest  in  common,  effective  economies  can  result 
from  the  exchange  of  magnetic  tapes  containing  ready  formatted  bibliographic  information.  Standards  have  been  laid 
down  for  t.ie  fo.mats  of  exchange  tapes,  but  these  will  generally  be  found  to  contain  optional  fields  which  require  precise 
definition  before  they  can  be  accepted  by  other  than  the  originator's  software.  Thus  it  is  advisable  to  enter  into  direct 
negotiations  with  any  organisation  which  offers  magnetic  tapes  on  an  exchange  basis  to  ensure  compatibility  of  software 
for  data  interpretation  and  conversion. 

Two  typical  bibliographic  file  layouts  are  compared  in  Figure  20  where  it  can  be  seen  that  the  comersion  program 
will  have  to  change  the  sequence  of  the  fields  in  the  originator’s  tape  and  omit  some  of  the  data  completely .  Hie  soft 
ware  must  also  be  designed  to  cope  with  special  characters  which  may  or  may  not  be  printable  by  the  recipient's 
computer,  and  the  allocation  of  control  codes  within  the  originator’s  system  must  also  be  carefully  considered. 

4.15  Direct  Printing  via  the  Computer 

t  he  complete  STI  system  winch  is  being  described  will  luwo  an  important  publishing  function,  which  means  that  the 
convenience,  speed  and  quality  of  printing  becomes  very  significant.  The  most  vital  aspect  of  the  various  printing 
operations  is  the  creation  of  the  masters  which  will  provide  the  pages  of  announcement  journals,  but  there  arc  many  less 
significant  printing  jobs  which  can  be  satisfactorily  carried  out  on  faster  machines  which  may  give  less  perfect  outputs. 

So  mue  of  the  printed  material  is  in  the  form  of  text  intended  for  discerning  readers  in  a  library  type  environment  that 
output  in  upper  and  lower  vise  has  come  to  be  regarded  as  normal.  Fins  generally  means  that  the  printing  peripherals 
will  be  operating  at  less  than  their  Maximum  design  speeds,  and  this  l.ictoi  must  be  allowed  for  at  the  system  design  stage. 
Decisions  concerning  the  printers  to  suit  a  specific  STI  system  will  depend  upon  local  requirements  such  as  the  volume 
of  text  to  be  handled,  the  standards  of  quality  which  have  been  established  and  the  time  constraints  with  have  lo  be 
applied  at  the  various  stages  of  production.  A  very  large  system  producing  printed  output  for  subject  searches,  current 
awareness  runs  and  regular  proof  reading  prints  might  justify  the  exclusive  use  of  an  expensive  high  speed  drum  printer, 
bn.  eicatcr  cost  efficiency  can  often  be  achieved  by  sharing  the  printing  load  between  two  or  three  smaller,  slower  pen 
pherals.  Typical  printers  suitable  for  oi  l  purposes  arc  shown  in  figures  23  to  2o  but  bearing  in  mind  the  late  at  which 
technological  developments  are  taking  place,  it  is  advisable  to  look  closely  at  the  current  printer  market  before  making  a 
final  ehoii'*'. 


4.16  Printers 

Most  of  the  printing  devices  offered  by  manufacturers  provide  ten  characters  per  inch  Upi)  across  the  page  and  print 
six  lines  to  the  inch,  although  twelve  epi  and  eight  lines  per  inch  can  sometimes  be  provided.  The  range  of  characters  and 
the  style  of  typo  face  to  be  used  in  any  specific  application  must  be  established  at  an  early  stage  of  the  design  of  the 
system,  allowing  for  the  fact  trial  the  visual  representation  of  control  codes,  such  as  field  and  record  separators,  will  be 
required  from  time  to  time.  Sets  of  64  diffciciU  characters  arc  usually  offered  for  text  applications  m  which  no  lovvci 
case  alphabet  will  be  printed,  and  %  character  sets  ate  recommended  wherever  output  is  required  in  both  upper  and 
lower  case.  Two  typical  Q(i  character  sets  are  shown  m  Figures  21  and  22  and  il  can  be  seen  that  symbols  are  available  to 
represent  ,<  wide  range  of  control  codes  and  other  characters  v  hich  arc  normally  ''non  print”.  Printing dcv.vCs  suitable 
for  STI  applications  can  be  grouped  into  the  feat  main  categories  which  are  described  briefly  below,  aldtough  it  van  be 
expected  that  individual  manufacturers  will  offer  special  features  within  these  broad  classes. 
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4.16.1  Tin  Drum  Printer  (Fig. 23)  is  the  fastest  kind  of  printing  peripheral  in  gcnct.il  use,  with  many  manufacturers 
offering  speeds  of  about  2000  lines  per  minute.  Chara.tcrs  arc  embossed  upon  the  surface  of  the  rotating  drum  and  fused 
hammers,  one  for  each  character  position  .dong  the  length  of  the  drum,  force  the  paper  instantaneously  onto  the  faces 
of  the  selected  characters.  This  sharp,  repeated  impact,  applied  \ia  the  inked  ribbon,  creates  lines  of  characters  on  the 
printed  page,  so  that  the  device  is  also  known  as  a  'line  printer’. 

4  16.2  The  Matrix  Printer  (Figures  22  and  24)  create  haraders  from  a  matrix  of  dots.  Manufacturers  offer  ingenious 
ways  of  minimising  the  spaces  between  dots  to  give  the  impression  of  a  continuous  ink  line,  and  it  follows  that  the  closer 
the  proximity  of  the  dots  (dots  pet  inch)  within  the  matrix,  the  denser  and  more  attractive  will  be  the  characters  produceu. 
A  minimum  of  seven  dots  is  recommended  across  the  face  of  cadi  character,  and  a  minimum  of  nine  dots  is  required  for 
each  character  vertically  to  allow  for  the  production  of  true  descenders,  ic  the  stems  of  ‘g\  *j\  *p\  etc.  The  characters, 
which  are  provided  in  the  usual  sets,  are  created  by  means  of  a  sequence  of  impulses  generated  within  the  machine 
(hard  wired  logic)  and  these  ..re  converted  into  patterns  of  dots  on  the  printed  page  according  to  any  one  of  a  variety  of 
mechanical  processes.  The  increasing  availability  of  micro  processors  makes  ii  much  more  feasible  to  inject  special 
characters  into  the  printing  sequence  under  the  control  of  the  user's  print  program.  Matrix  printers  are  generally  slower 
and  less  expensive  than  drum  printers,  providing  good  quality  output  at  speeds  between  300  and  oOO  lines  per  minute. 

4.16J  The  'Dai*}  Wheel  Printer  (Fig.25)  produces  output  of  very  good  quality  at  speeds  around  50  characters  per 
second,  approximating  to  30  lines  pet  minute,  A  repertoire  of  88  characters  is  generally  provided  for  upper  and  lower 
.  nee  applications,  with  the  characters  embossed  at  the  ends  of  ‘petals'  formed  by  means  of  radial  cuts  in  a  flat  disc  about 
75  mm  in  diameter.  As  the  disc  rotates  the  selected  characters  arc  pressed  onto  the  paper,  through  the  inked  ribbon,  to 
create  lines  of  print.  Printers  of  this  kind  have  very  few  moving  parts  and  are  the  least  expensive  to  buy ,  and  they  have 
the  advantage  of  providing  a  variety  of  type  faces  according  to  the  disc  selected. 


87 


4.16.4  The  Electrostatic  Printer  (Fig.26)  is  relatively  expensive,  but  it  provides  output  of  the  best  quality.  The  three 
kinds  of  printer  described  so  far  can  give  multiple  copies  at  a  single  pass  by  using  continuous  stationery  inter-leaved  with 
carbon  or  with  paper  especially  treated  to  provide  multi-part  prints  as  a  result  of  an  impact  process  but  without  carbons. 
The  electrostatic  printer  however,  provides  single  copies  only  on  paper  which  is  sensitised  to  record  electrical  intpulses. 
Each  character  is  created  initially  as  a  pattern  of  electrically  charged  dots  and  the  paper  is  then  brought  into  contact  with 
magnetic  ink  which  adheres  only  to  the  charged  areas.  The  ink  is  dried  instantaneously  and  a  permanent  record  is 
retained  on  the  paper.  Hard-wired  logic  boards  provide  a  range  of  character  sets,  and  the  process  allows  for  the  dots  to  be 
more  closely  packed  than  is  possible  with  matrices  which  are  created  mechanically.  Characters  appear  to  be  more 
intensely  black  and  are  thus  mors  attractive,  and  economies  in  the  use  of  paper  can  be  achieved  in  some  instances  by 
operating  at  more  than  twelve  characters  per  inch.  Using  normal  characters  electrostatic  machines  can  provide  1000  lines 
of  print  per  minute, "but  the  process  is  closely  allied  to  graph  plotting  so  that  it  can  give  an  almost  limitless  range  of 
characters  and  character  sizes  at  slower  speeds  in  response  to  appropriate  logic  in  the  user’s  program.  Some 
manufacturers  provide  sets  of  cyrillic  characters  which  are  generated  from  logic  boards  plugged  into  the  machine. 

4.1 7  The  Remote  Printing  of  Computer  Output 

It  has  been  assumed  so  far  that  the  printing  devices  for  the  production  of  computer  output  in  the  form  of  paper- 
copies  (hardcopy)  are  linked  directly  to  the  processor.  The  buffers  into  which  the  data  are  fed  from  the  processor  may 
only  hold  one  or  two  lines  of  characters  at  a  time,  so  the  speed  of  operation  becomes  very  slow  when  compared  with  the 
data  transfer  rates  of  magnetic  media  (tapes  or  discs)  or  even  of  punched  paper  tape  in  some  instances.  Where  the 
demand  for  printed  output  justifies  it,  greater  overall  efficiency  can  be  achieved  by  the  use  of  remote  printing  devices, 
and  the  introduction  of  such  ‘off-line’  resources  opens  the  way  to  the  consideration  of  computer-controlled  photo¬ 
typesetting  and  a  standard  of  page  presentation  which  can  compare  favourably  with  that  of  a  well-printed  book.  If  it  is 
decided,  in  the  course  of  the  design  of  an  ST1  system,  that  printing  will  take  place  off-line,  then  the  end  product  of  the 
data  processing  stage  of  the  operation  will  be  in  the  form  of  punched  paper-tape,  magnetic  tape  or  floppy  disc,  each 
especially  formatted  to  meet  the  requirements  of  the  selected  printing  device. 

4.17.1  Off-line  printing  can  be  used  in  association  with  any  of  the  printers  already  described  and  is  recommended  as  an 
efficient  way  of  dealing  with  long  and  uninterrupted  print  jobs.  In  a  time-sharing  computer  system,  i.e.  one  in  which 
several  programs  can  be  run  simultaneously,  the  print  program  for  the  production  of,  say,  the  pages  of  the  announcement 
journal  can  be  run  as  a  background  job  very  economically,  but  a  good  quality  printer  must  be  dedicated  to  that  job 
throughout  the  run  A  magnetic  tape  set  up  for  printing  in  this  way  is  valuable  for  archival  purposes  and  will  also  provide 
the  input  for  any  other  incremental  tape  which  may  be  required,  c.g.  the  database  update  tape  or  an  information  exchange 
tape. 

4. 1 7.2  Computer-controlled  photo-typesetting  provides  printout  of  a  very  high  quality  but  involves  rather  costly  equip¬ 
ment.  As  the  name  implies  the  basic  output  of  the  photo-typesetter  is  an  image,  generally  a  page  or  column  of  text, 
created  on  photo-sensitive  paper  by  exposing  it  under  strictly  controlled  conditions  to  light  or  an  electron  beam  which 
has  passed  through  a  translucent  medium  on  which  characters  are  inscribed.  Some  modern  devices  use  electronic 
character  generation  techniques  with  a  cathode  ray  tube  display.  Type  sizes  can  be  varied  by  means  of  lenses,  and  a  single 
character  grid  or  a  specific  set  of  character  generation  logic  is  generally  of  a  sufficient  capacity  to  accommodate  several 
related  alphabets,  thus  providing  a  number  of  different  type  faces.  The  device  responds  to  two  data  streams,  the  first  of 
which  selects  oi  creates  the  characters,  leaving  tiie  second  to  place  them  in  their  proper  sequence  on  the  display  tube  for 
subsequent  exposure,  line  by  line,  onto  photosensitive  paper  or  film.  A  Linotron  505  photo-typesetter  was  used  to  create 
the  page  of  text  shown  in  Figure  27  operating  upon  data  supplied  on  magnetic  tape.  Figure  28  illustrates  the  operating 
principles  of  the  Linotron  202  machine,  and  typesetters  of  this  kind  provide  film  output  of  a  size  which  is  directly 
commensurate  with  the  normal  printed  page.  Other  designs  of  photo-typesetter  produce  microfilm  output  (computer 
output  microfilm  or  ’COM’  see  Section  9,  Vol.lll  of  the  Manual)  which  must  be  enlarged  photographically  when  printed 
pages  are  required.  Logic  can  be  devised  to  enable  the  larger  machines  to  carry  out  automatic  hyphenation  of  words  at 
the  end  of  lines  of  text  and  to  create  a  uniform  line  length  by  varying  the  spaces  between  words  and  characters,  a  feature 
known  as  right-hand  justification.  Simpler  and  slower  type-setters  use  character  sets  inscribed  upon  continuous  bands  of 
film,  several  of  which  can  in  some  instances  be  accommodated  simultaneously  in  the  machine  to  give  a  range  of  type 
faces.  Depending  upon  the  type  faces  used  and  any  special  format  features  involved,  it  can  take  just  a  few  minutes  to 
create  a  typset  page.  Programming  for  computer-based  photo-typesetting  requires  special  skills,  since  the  techniques 
involved  are  expressed  in  terms  familiar  to  the  printing  industry,  and  the  output  requires  expert  line-by-line  analysis  in 
order  to  lay  down  the  software  logic  rules  which  will  govern  the  format  to  be  adopted  for  the  display  of  the  data.  The 
process  can  employ  type  faces  which  are  both  small  and  clear,  and  can  space  the  characters  proportionately  in  the  line, 
thus  almost  doubling  the  quantity  oi  text  on  a  page  as  compared  with  the  output  from  conventional  computer  printers. 

4.1 8  The  Layout  cf  Printed  Pages 


In  order  that  the  maximum  benefit  may  be  obtained  from  an  ST1  system  it  is  essential  that  the  best  quality  output 
shall  be  that  which  is  read,  either  on  a  VDU  screen  or  in  a  printed  publication,  by  the  user.  Extensive  work  has  b-an 
carried  out  by  the  UK  Royal  College  of  Art  on  the  readability  of  print,  and  the  appearance  of  the  printed  page,  both  f'om 
the  practical  and  the  aesthetic  points  of  view,  is  of  great  significance  to  the  user  of  the  system  when  he  assesses  the 
acceptability  of  its  products.  The  means  available  for  creating  well  laid-out  pages  are  therefore  described  in  detail  in  the 


Vol,  30  No.  1 


88 

Field  9  ELECTRONICS  AND  ELECTRICAL  ENGINEERING 
Group  9A  Components 


Anplications.Toulouse,l972.France 

VERY  HIGH  VOLTAGE  POWER  SUPPLY  (2500  V.  500  \Y) 

(Translated  trom  French) 

ALIMENTATION  TRES  HAUTE  TENSION  I)E  PUISSANCE: 
2500  V,  500  W 

l.cmoine.J.D.  5. 1974  Upp 

Availability:  TRC  £0.70 

Describes  a  high  voltage  power  supply  (2500  V.  500  W) 
suitable  for  an  ion  thruster.  The  general  considerations  of  the 
objectives  of  the  study  are  followed  by  a  description  of  the 
circuits  and  the  results  obtained  on  a  breadboard  1098-3001 

Indexing  terms,  *  Power  supply  circuits/*  High  voltage/Mon 
engines/Ion  thrusters//09a / 


9H  Computers 

174-0428.3  NPL-REP-NAC-48 
National  Physical  Lab.,Teddington.Mddx.,UK 
WRITING  ALGORITHMS  IN  ALGOL  60 
Hill, I. D.  Scowtu.R.S.  4.1974  2lpp  I5ref 
Availability:  TRC  £0.50 

Discusses  the  difficulties  of  writing  procedures  and  programs 
in  ALGOL  60  for  general  use.  It  also  shows  how  the 
problems  can  be  alleviated  and  overcome.  1099-3001 

Indexing  terms-  ’Computer  programs/* Algol/*  Algorithms// 
09b/62b? 


T74-04962  MAC-TR-123 
Massachusetts  Inst.of  Tech. .Cambridge. USA 
INTRODUCTION  TO  MULTICS 
Saltzer.J.H.  2.1974  213pp58ref 

N-000I4-70-A-0362-0006 
Availability:  TRC  £5.40 

The  report  is  an  introduction  to  the  properties,  concepts,  and 
usage  of  the  Multics  (MUI.Tiple.xed  lnformaton  md 
Computing  Service)  system.  Its  four  chapters  are  designed  for 
reading  continuity  rather  then  for  reference  or  completeness. 
Chapter  1  provides  a  broad  overview  Chapter  2  goes  into  the 
concepts  underlying  Multics.  Chapter  3  is  a  tutorial  guide  to 
the  mechanics  of  using  the  system,  with  illustrative  examples 
of  terminal  sessions.  Chapter  4  provides  a  series  of  examples 
of  programming  in  the  Multics  enviionment.  1 100-3001 

Indexing  terms:  'Computers/*  Data  storage/*  lnfomation 
retricvnl/'Time  sharing/Multics/Pr.iject  MAC/Multiplcxing// 
09b/ 


T74-04963 

Arizona  Univ., Engineering  Experiment  Station.Tuc.son.USA 
INTERACTIVE  GRAPHICS  SOFTWARE  FOR  USE  IN 
FINITE  ELEMENT  ANALYSIS  (ANNUAL  REPORT) 
Kamel.H.A.  McCabe, M.W.  2  1974  I76pp  30rcf 
N-00014-67-A-0209-0916 
Availability;  TRC  £4.40 

Includes  a  summary  of  activities  during  the  1973  calender 
year,  as  well  as  a  number  of  appendices  related  to  the 
development  of  the  GIFTS  (Graphic  oriented  Interactive 
finite  element  analysis  on  Time  Sharing)  .system  during  this 
period.  Appendix  A  contains  a  paper  outlining  the  current 
GUT'S  system.  This  paper  has  been  submitted  fi  t 
presentation  to  the  CNR  International  Symposium  on 
Structural  Mechanics  Soflwate.  Maryland,  June  1974.  One  of 
the  test  cases  chosen  for  the  system  was  a  two-dimensional 
analysis  of  a  human  bone  in  support  of  a  theory  linking  bone 
stresses  to  arthritis  of  the  joints.  This  paper  is  included  as 
Appendix  11.  A  revised  and  updated  user's  manual  and  the 
newest  format  for  the  Unified  Data  Rase  are  included  as 
appendices.  A  new  item  is  a  solved  example  manual  containing 
two  problems  of  varying  complexity.  Additional  appendices 
pertaining  to  the  improvement  of  solution  efficiency  are  also 
included.  1101-3001 
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174-04998  REP-SF-2 
Colorado  Slate  Univ  .Fort  Collins.USA 
ELECTRONICS  OF  COMPOUND  SEM1CONDICTOR 
INTERFACES 

SitesJ.R.  Wilmsen.CAV,  3.1974  8Spp 
Availability:  TRC  £2.50 

Contains  the  summaries  of  the  formal  presentations  made  at 
the  Electronics  of  Compound  Semiconductor  Interfaces 
workshop  conference  held  February  5-6,  1974.  at  Colorado 
State  Univetsity.  The  purpose  of  the  workshop  conference 
was  to  learn  what  progress  is  being  made  in  understanding  the 
boundaries  of  compound  semiconductors  and  the  significant 
effects  these  interfaces  have  on  the  operation,  reliability,  and 
stability  of  electronic  devices.  Various  participants  reported  on 
the  behaviour  of  free  semiconductor  surfaces,  semiconductor- 
metal,  semiconductor-insulator,  and  semi-conductor- 
semiconductor  interfaces.  For  the  most  part  the  discussions 
dealt  with  compourd  semiconductors  with  particular  emphasis 
on  III- V  materials.  1102-300! 
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174-05314  ARRE-R-7587 
Atomic  Energy  Res.Est.,Harvvcll,UK 
DESIGN  OK  HARDWARE  SIMULATORS  FOR  COMPUTER 
INPUT/OUTPUT  CHANNELS 
Purehes.M.J.  2.1974  22pp  9ref 

Availability:  HMSO  £0.50 

Difficulties  in  implementation  of  new  designs  of  computer 
peripherals  can  be  cased  significantly  by  employing  hardware 
computer  simulators  for  many  of  the  tests.  The  report 
identifies  some  situations  where  simulators  can  be  gainfully 
employed;  makes  general  design  suggestions;  and  describes  in 
detail  two  simulators  in  regular  use  at  Harwell.  1 103-3001 
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Joint  Nuclear  Res  Centre. lspra  Est., Italy 

REAL  TIME  OPERATING  SYYSTEM  FOR  SMAR  T  DATA 

TERMINALS 

Edcr.J.  HammansAV.  5,1974  48pp 
Availability:  TRC  £150 

The  need  for  an  effective  communication  link  between  various 
digital  data  sources  scattered  around  a  big  laboratiry 
environment  and  a  commonly  shared  data  logging  system 
(SPO-16  minicomputer  with  disc  and  magtape)  led  to  the 
development  of  SMART  datastntious  (1).  SMART 
datastations  are  active,  they  can  take  over  partially  control  of 
the  comp' ' The  terminals'  user  has  real  time  access  ami 
may  inf  heme  the  data  flow  in  the  computer  A  software 
package  •.  described  in  this  report  which  allows  to  control 
several  simultaneously  working  terminals  in  a  disc  based 
operating  system  lDDOS-16).  A  control  routine  recognises  a 
set  ol  control  commands  from  the  terminals  and  stores 
incoming  data  due  to  the  respective  station  address  on  disc. 
Some  off-line  routines  are  provided  for  file  assignment,  code 
conversion  and  editing  of  IBM  compatible  magnetic  tapes. 
1104-3001 
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Gemcinsame  Kernforschungsstcllc  l  orschijugsaustalt'lspra, 
Italy 

INTERFACE  FOR  THE  CONTRO  OF  A  MULTICHOPPER 
FACILITY  (Report  in  German) 

INTERFACE  7. UR  STKURRUNG  FINER 
MULTICHOPPKRANLAGK 
EderJ.  llr.ik.IL  5  1974  22pp  2ref 
A  \  adability :  TRC'  £1.00 

An  interface  for  the  coupling  of  a  multi-chopper  facility  to  a 
computer  has  been  developed,  which  provides  pioeram 
controlled  settings  of  rotor  speed  (period  time)  and  ph.oe 


Fig.27  Computer  typeset  abstract  journal 
A  sample  page. 


Fig.28  The  Linotron  202  computer-controlled  photo-typesetter 
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ensuing  paragraphs,  for  there  are  features  which  can  be  incorporated  into  the  printing  procedures  to  give  output  formats 
which  are  both  effective  and  pleasing  to  the  eye  as  well  as  reducing  fatigue  in  the  reading  of  the  text.  It  is  usual  to  have 
a  relatively  slow  printing  device  available  for  high-quality  work,  but  allowance  must  also  be  made  for  those  parts  of  the 
system  which  produce  programs  listings,  operating  statistics  and  similar  routine  printed  output  for  use  within  the 
computer  area  only.  Such  work  demands  speed  and  efficiency  rather  than  quality  of  print,  provided  that  it  is  legible. 

4.18.1  Announcement  Journals  and  Accession  Lists  require  special  care,  and  there  are  four  important  features  which 
combine  to  create  a  good  page  format.  If  the  pages  are  arranged  according  to  subject  and  the  main  subject  headings  are  in 
alphabetical  order  throughout  the  publication,  each  left-hand  page  of  text  should  display  at  the  top,  printed  bold,  the  main 
subject  heading  assigned  to  the  first  item  of  bibliographic  information  on  that  page,  and  each  right-hand  page  should 
display  the  heading  assigned  to  the  last  item  on  that  page,  liven  if  there  are  items  belonging  under  several  headings  on  a 
pair  of  facing  pages,  it  is  sufficient  to  display  only  the  first  and  the  last.  If  adjacent  pages  contain  only  items  belonging  to 
the  same  main  subject  group,  then  the  page  heading  should  be  followed  by  the  word  ‘(continued)’,  bracketed  as  shown, 
wherever  it  applies.  Secondly,  the  accession  number  which  uniquely  identifies  each  item  in  the  publication  should  stand 
out  boldly.  Thirdly,  there  should  be  a  sufficient  allowance  of  ‘light’,  i.e.  space  between  words  and  between  lines  on  each 
page,  without  wasting  paper  and,  finally,  no  item  should  start  too  near  the  end  of  a  column  or  page  of  text.  The  impres¬ 
sion  of  hold  type  can  be  created  on  some  printers  by  holding  a  line  of  text  in  a  buffer  and  repeating  the  line  by  using  the 
‘Carriage  Return’  code  without  a  ‘Line  Feed’.  Otherwise  upper  case  characters  used  in  conjunction  with  careful  spacing 
can  be  quite  effective.  Fourthly,  the  print  program  should  be  designed  to  look  ahead  to  avoid  starting  any  items  too  near 
the  bottom  of  a  column  or  page  of  text.  A  useful  standard  is  the  ‘five-line  rule’  whereby  there  must  be  room  to  accom¬ 
modate  at  least  five  lines  of  text  before  a  fresh  item  is  started  at  the  end  of  a  page.  The  routines  set  up  for  printing  main 
subject  headings  and  sub-headings  must  also  be  taken  into  account  when  programming  to  incorporate  this  rule.  It  would 
be  advantageous  to  hold  the  accession  number  belonging  to  each  item  in  store  throughout  the  printing  of  the  whole  item. 
Then,  should  it  prove  necessary  to  continue  the  item  on  a  second  page,  the  heading  information  on  the  new  page  could 

be  followed  by  a  repeat  of  the  accession  number  followed  by  the  word  ‘(continued)’  placed  after  it  in  brackets  as  shown. 

It  is  usual  to  hold  the  main  subject  headings  and  the  sub-headings  in  both  coded  form  and  in  text  in  a  look-up  list 
which  resides  in  the  computer’s  data  store  when  the  printing  program  is  being  run.  The  first  data  field  in  each  record 
should  be  the  main  subject  code  which  is  matched  by  program  against  its  counterpart  in  the  look-up  list.  This  matching 
operation  is  then  used  as  a  trigger  to  copy  the  complete  heading  from  the  list  into  its  proper  position  on  the  printed 
page,  thus  basing  this  part  of  the  system  upon  an  authority  liU  which  can  be  readily  updated  when  additional  scientific 
and  technological  disciplines  have  to  be  accommodated.  Page  numbers  can  also  be  held  in  a  look-up  list  controlled  by 
program  logic  which  will  place  the  odd  numbers  in  the  top  right-hand  corner  of  right-handed  pages  and  the  even  numbers 
in  the  top  left-hand  corner  of  left-handed  pages.  Page  I  of  a  publication  should  always  be  a  right-hand  page. 

4.15.2  Photo-reduction.  It  is  quite  feasible  to  create  two  or  three-column  page  structures  (two-up  or  three-up  pages) 
when  printing  by  <'°mputer-controlled  photo-typesetter,  but  it  is  wise  to  use  only  single  column  pages  in  systems  which 
employ  orthodox  printers.  Assuming  standard  A4  sizes  with  a  page  width  of  8’/j  inches  (210  mm),  the  text  can  occupy 
up  to  7  inches  (178  mm)  of  lie  length  to  give  a  binding  margin  of  7/8  inch  (22  mm)  and  a  free  margin  of  3/8  inch 

(10  mm)  along  the  opposite  edge.  Where  no  photo-reduction  facilities  are  available  it  may  be  necessary  to  base  the  page 
structure  on  a  70  character  line  (7  inches  x  10  characters  to  the  inch),  jut  it  can  be  more  satisfactory  and  economical  to 
allow  the  full  80  character  line,  which  is  generally  available  on  the  VDU  screens  used  at  the  data  prep  stage,  to  form  the 
basis  of  the  printed  output.  There  is  a  wide  range  of  equipment  available  on  the  commercial  market  from  companies  such 
as  3M’s  and  Kodak  which  will  provide  variable  photo-reduction  ratios,  some  offering  a  range  of  sizes  from  50%  reduction 
to  50%  enlargement  of  the  original.  The  facility  for  reducing  original  sizes  has  been  proved  to  be  the  more  useful,  and 
well-produced  10  character  to  the  inch  computer  printer  output  can  be  quite  legible  down  to  about  60%  of  the  original 
character  size,  i.e.  with  the  camera  set  to  enable  a  line  of  1 20  characters  occupying  1 2.0  inches  (304  mm)  to  fit  into  about 
7  inches  ( 1 78  mm).  As  indicated  in  paragraph  4. 1 8.3  this  degree  of  photo-reduction  can  prove  particularly  valuable  in  the 
production  of  indexes  to  announcement  journals  since  these  are  generally  browsed  through,  whereas  the  main  pages  of 
text  are  more  carefully  read  and  therefore  require  the  best  possible  quality  of  print. 

If  it  is  decid  a  to  produce  the  master  pages  of  text  for  publications  by  photo-reducing  original  output,  then  the 
computer  printer  will  be  expected  to  create  especially  good  camera-ready  copy.  The  1 2Vi%  reduction  required  to  place 
80  characters  in  a  7  inch  (178  mm)  line  length  will  be  automatically  applied  to  the  length  of  the  page  as  well,  but  it  is 
generally  considerations  of  width  which  govern  page  sizes,  since  it  is  relatively  easy  to  adjust  the  line-counts  in  a  print 
program.  There  must  be  an  allowance  oi  space  at  the  top  and  bottom  of  each  page,  so,  assuming  six  lines  to  the  inch  on 
the  original  printer  output  and  a  final  page  length  of  1 1%  inches  (29"?  mm),  each  page  can  be  allowed  to  contain  a 
maximum  of  65  lines.  This  figure  is  estaolished  by  the  fact  'hat  the  continuous  stationery  for  computer  printers  normally 
measures  1 1  inches  (278  mm)  between  transverse  perforations  and,  on  the  photo-reduced  page,  64  lines  can  give  a  1  inch 
(25  mm)  clear  margin  at  the  top  and  1  ‘A  inch  (32  mm)  clear  at  the  bottom.  The  positioning  of  the  page  ne.jbers  and  the 
allocation  of  blank  lines  to  give  light  to  a  page  are  important  features  in  the  design  of  good  printing  formats. 


4.1  S.S  Printed  indexes  do  not  generally  require  the  same  high  standards  of  printing  and  page  layout  as  are  necessary  for 
the  main  text.  The  sample  shown  in  Figure  3  was  produced  in  upper  case  only  using  a  1 20  character  position  line  and 
allowing  a  two  page  length  of  continuous  stationery  (22  inches  or  556  mm)  per  page  of  text,  printing  a  maximum  of 


91 


1 08  lines  ( 1 8  inches  or  457  mm).  This  process  requires  careful  setting  up  of  the  paper  on  the  computer  printer  to  ensure 
that  the  transverse  perforations  pass  between  two  adjacent  lines  of  print,  and  the  final  page  master  is  obtained  by  photo- 
reducing  the  computer  output  sheets  to  56%  of  their  original  size.  Tire  sample  page  shown  in  Figure  29  has  been  photo- 
reduced  to  70%  of  the  original  size  on  a  fixed  ratio  offset-litho  plate  making  camera  device.  It  is  necessary  to  make  care¬ 
ful  calculations  when  defining  the  original  line  and  page  lengths  for  examples  of  this  kind  before  laying  down  the  print 
program  parameters,  and  it  is  advisable  to  show  samples  of  the  photo-reduced  output  to  members  of  the  staff  of  the 
Centre  and  to  invite  their  comments  on  the  acceptability  of  the  print.  The  legibility  of  the  text  shown  in  Figure  29  might 
be  improved,  for  instance,  by  printing  the  original  pages  at  six  lines  to  the  inch  instead  of  eight. 


AN  EXTRACT  FROM  THE  SUBJECT  INDEX 


1  CLOUd'cOMPOSIIION  DETERMIHAT tOH  BY  SATELLITE  SENSING  US1HG  THE  NIMBUS  VI 
HIGH  RESOLUTION  INFRARED  SOUNDER  >lor 

ERROR  ANALYSES  OF  OPERATIONAL  SATELLITE  SOUNDINGS  OF  VERTICAL  TEMPERATURE 
PROFILES  <12  1  -30  9  Iff??) 

TECHNOLOGY  RE 8U  !RE RENT S *F OR  ADVANCED  EARTH  ORBITAL  TRANSPORTATION  SYSTEMS 
VOL  3:  SUMMARY  REPORT  -  DUAL  MODE  PROPULSION 

AI"°cIhERALIZEDCC0NPUTER  PROGRAM  FOR  PR 1M 1 T t V£ -EQUAT 1  OH  MODELS  <1.10.1976- 

0c1aN5THERNAL  RESPONSE  TO  STRONG  ATMOSPHERIC  FORCING.  l!  CHARACTERISTICS 
OF  FORCING  EVENTS!  2:  THE  ROLE  OF  GHE-Dt  KEhSl OHAL  PROCESSES 
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T78-6 275 

T78-6164 
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AN  EXTRACT  FROM  THE  AUTHOR  INDEX 


A  i  rscouqh ,  J.  B. 

EURGPIA  CERAMICS  FOR  USE  AS  FAST  REACTOR  NEUTRON  ABSORPERS 
Alton. T. 

INTECRATING  PRODUCT  DESIGN  AND  MANUFACTURE  BY  COMPUTER 
Anable.  U. 

SMELTING  CEMENT  COPPER  IN  AH  ELECTRIC-ARC  FURNACE 
Anctrson.  J.  R. 

THEORY  OF  THE  ACOUISITION  OF  COGNITIVE  SKILLS  < 0 nnuary-Doctnber  1977) 
Andrews,  A. 

REMOTE  ION  BEAM  POSITION  MEASUREMENT  FROM  RANDOM  BEAN  MODULATION 
Armstrong.  R  0. 

OEGEfiER  ACY  IN  SPECIAL  PURPOSE  PRIMAL  ALGORITHMS  USED  IN  OBTAINING  LEAST 
ABSOLUTE  VALUE  ESTIMATORS 


T78-6110 
T78  -6244 
T78-5944 
T78-5746 
T78-6006 
T78-5766 


Fig.29  Sample  indexes  from  R  &  D  abstracts  -  TRC  1 979 


4.18.4  Contents  pages  (see  Figure  18)  can  be  produced  automatically  once  the  main  text  of  an  incremental  issue  of  the 
announcement  journal  has  been  printed.  The  print  program  can  incorporate  logic  for  storing  a  list  of  the  main  headings 
and  sub-headings  used  in  a  particular  issue,  along  with  the  numbers  of  the  pages  upon  which  they  occur.  The  data  are 
then  formatted  and  printed  out  to  provide  the  reader  with  an  easy  index  to  the  contents  of  the  journal.  These  contents 
pages  arc  placed  with  the  standard  introductory  material  which  guides  the  reader  in  his  use  of  the  publication  and  advises 
him  on  such  matters  as  the  method  of  obtaining  the  actual  reports  announced  in  the  journal.  They  arc  inserted  before 
the  main  text  and  must  therefore  have  a  separate  page-numbering  system  such  as  i,  ii,  iii  etc. 

4.15.5  Print  quality  depends  very  much  upon  the  choice  of  printing  process  and,  for  orthodox  on-line  printing,  the 
choice  of  computer  printer.  These  decisions  depend,  in  turn,  upon  the  volume  of  work  and  the  resources  available,  but 
the  output  from  the  most  modestly  pticed  printing  device  can  be  optimised  by  using  opaque  plain  white  paper  and  by 
carrying  out  each  important  printing  run  with  a  well  cleaned  machine  and  a  new  carbon  ribbon.  Printing  work  of  lesser 
significance  can  be  scheduled  to  fill  in  the  time  until  the  printer  is  required  for  the  next  good-quality  run.  There  is  no 
need,  for  instance,  to  use  a  siow  printer  and  take  special  precautions  when  carrying  out  SDI  runs,  although  upper  and 
lower  case  printing  is  highly  desirabh  especially  if  abstracts  are  provided,  and  plain  paper  is  generally  preferred. 

4.19  Production  Scheduling 

It  is  quite  impracticable  to  lay  down  a  universal  scale  for  the  measurement  of  throughput  in  computer-based  ST1 
systems,  so  that  any  performance  figures  which  are  quoted  by  working  organisations  czn  only  be  regarded  as  a  rough 
guide.  The  basic  production  parameters  will  reflect  the  quantity  of  documents  passing  through  the  system  in  a  year  and 
the  frequency  of  issues  of  the  journal  in  which  they  are  announced.  Dividing  the  quantity  of  documents  by  the  number 
of  issues  will  provide  an  indication  of  the  number  of  items  of  bibliographic  information  to  he  announced  in  each 
incremental  issue  which,  when  multiplied  by  the  average  number  of  characters  per  item,  gives  the  basis  for  assessing  the 
rate  at  which  the  data  prep  facility  can  be  expected  to  work.  When  making  these  calculations,  allowances  must  be  made 
for  such  factors  as  irregularity  in  the  rate  of  document  flow  and  the  inevitable  computer  down  time  and,  because  the 
reputation  of  an  STI  organisation  will  be  based  upon  the  dependability  of  the  services  which  it  provides,  these  allowances 
should  be  made  as  generous  as  possible.  It  can  be  assumed  that  a  VDU  keyboard  operator  will  provide  between  five  and 
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six  hours  of  useful  input  in  a  working  day  aiul,  in  that  time,  should  produce  30,000  corrected  key  depressions,  i.e.  about 
5,000  key  depress;  oe.  hour.  This  figure  takes  account  of  an  assumption  that  approximately  one  third  o.  the 
operator’s  time  is  spent  in  reading  back  text  o  hiv-h  has  already  been  keyed  m.  Thus,  in  systems  where  the  initial  proof¬ 
reading  and  correction  of  data  are  carried  out  outside  the  data  prep  room,  a  higher  rate  of  key  depressions  can  be 
expected. 

Ty  pical  production  cycles  are  based  upon  monthly ,  semi  monthly  or  fortnightly  incremental  inters als,  and  each 
issue  of  the  announcement  journal  should  be  ready  for  distribution  two  or  three  working  days  before  the  date  printed  on 
the  cover.  In  drawing  up  the  production  schedules  it  is  essential  to  ensure  that  the  pnnt  shop  which  converts  the  camera- 
ready  copy  created  by  the  comparer  into  bu.md  books  has  adequate  capacity ,  and  that  the  data  base  handling  capability 
of  the  organisation  for  the  production  of  regular  Shi  output  is  sufficient  to  meet  the  users’  needs.  The  journal  editor, 
the  computer  operations  manager  and  the  data  prep  supervisor  should  consult  togethei  to  draw  up  the  schedules,  and  it 
is  "'morally  incumbent  upon  the  eddor  to  monitor  progress  and  to  attempt  to  clear  hold-ups  in  the  work  flow  as  they 
arise. 


4.20  File  Security 

There  is  no  need  to  retain  incremental  disc  files  of  bibliographic  data  for  long  periods  once  an  issue  of  the  announce¬ 
ment  journal  has  been  published  and  the  data  base  has  been  updated.  It  is  sensible  to  retain  the  journal  page  masters  for 
up  to  two  years  in  case  a  reprint  of  any  issue  should  be  called  for  in  that  time,  and  a  similar  retention  period  for 
incremental  magnetic  tapes  could  provide  a  valuable  back  up  service  to  the  data  base  computer.  Three  ‘generations'  of 
the  data  base  will  be  required  on  magnetic  tape,  with  each  generation  updated  as  a  new  increment  is  added  to  the  direct 
access  disc  file  on  the  data  base  computer. 

Cure  should  be  taken  to  protect  new  data  as  they  are  being  entered  from  the  YDU  terminals.  Systems  fot  handling 
text  is  this  way  generally  incorporate  facilities  for  accessing  a  common  file  from  a  number  of  terminals  at  the  same  time, 
a  practice  which  implies  considerable  risk  of  file  corruption.  It  is  therefore  advisable  to  make  a  back-up  copy  of  each 
input  file  at  least  tw  ice  a  day  and  to  update  the  incremental  file  on  disc  at  the  close  of  each  day 's  operations,  making  a 
copy  of  the  file  on  magnetic  tape  at  the  same  time.  At  least  three  geneiations  of  the  incremental  magnetic  tape  should  be 
maintained.  When  an  issue  of  the  announcement  journal  is  ready  for  printing  it  is  usual  to  set  up  a  disc  file  containing  the 
indexes  w  hic.li  are  to  be  printed  w  ith  the  journal  and,  where  accumulated  indexes  are  being  created,  arrangements  w  ill 
have  to  be  made  to  update  the  related  accumulated  magnetic  tape  files.  Back  up  copies  of  the  per  issue'  index  files 
should  be  held  on  magnetic  tape  with  the  text  files,  and  three  generations  of  the  accumulated  index  files  should  be 
maintained. 

4.21  The  Handling  of  Security  Classified  Information 

The  .  .cunly  of  files  described  in  paragraph  4.20  refers  to  essential  precautions  against  the  loss  of  vital  data  due  to 
file  corruption  which  may  be  caused  by  mechanical  breakdown  or  human  fallibility .  Aspects  of  the  handling  of  data 
which  carry  a  security  classification  and  must  therefore  be  protected  against  unauthorised  access  are  dealt  with  m 
Section  1 0,  Yol.IY  of  the  Manual,  and  the  impact  of  such  data  upon  a  computerised  sy  stem  w  ill  depend  upon  the  degree 
of  secrecy  involved.  Bibliographic  details  of  documents  of  the  highest  orders  of  confidentiality  might  never  be  stored  in 
a  computer,  whereas  less  sensitive  material  might  be  handled  only  by  nominated  members  of  staff  and  held  on  separate 
files  with  back-up  copies  stored  in  a  special  limited  access  safe.  Wherever  possible  a  special  version  of  the  computer 
operatingsystem  should  be  kept  fot  use  with  security  classified  information,  and  precautions  must  be  taken  to  ensure 
that  all  data  are  erased  from  core  store  and  the  internal  work  areas  at  the  end  of  each  security  run. 

4.22  Housekeeping  and  Stock  Control 

The  main  work  of  an  STI  Centre  is  the  recorvli.ig  of  complete  report  references  for  information  processing  and 
retrieval  purposes  in  large  bibliographic  files,  but  this  can  be  greatly  assisted  by  the  recording  of  a  minimal  description  of 
cadi  accession  on  internal  files  for  housekeeping  and  stock  control  purposes.  A  ty  pic.il  stock  control  system  can  be  based 
upon  three  files,  a  Reports  File,  a  Transactions  File  and  a  Customer  File,  and  sample  records  are  shown  m  Figures  30, 3 1 
and  32. 

Reports  reaching  the  C  -litre  can  comprise  new  material  oi  duplicate  copies  of  reports  already  m  the  system  or 
returned  loans,  and  it  is  strongly  argued  that  the  most  error  proof  way  of  operating  the  initial  recording  phase  of  the 
stock  control  system  is  to  maintain  a  file  of  handwritten  document  Movement  Cards  against  which  all  incoming  reports 
can  be  immediately  checked.  By  keying  a  minimal  record  defining  each  new  accession  into  a  Reports  File  from  a  terminal 
at  the  point  at  which  reports  are  received  however,  it  should  be  possible  to  meet  the  requirements  of  this  initial  recording 
operation.  It  would  then  be  necessary  to  provide  software  with  rapid  information  retrieval  facilities  operating  in  an 
interactive  mode.  The  three  basic  stock  control  files  are  linked  so  that  information  common  to  more  than  one  of  them 
need  only  be  keyed  in  once  and,  sime  requests  for  document,  are  recorded  m  the  Transactions  File,  tile  „y stem  can  be 
readily  used  for  invoicing  and  accounting  purposes  where  reports  are  sold,  and  fot  the  printing  of  reminders  for  the 
control  of  overdue  loans.  Depending  upon  the  rate  at  which  documents  flow  tluough  the  system ,  several  days  can  elapse 
between  the  initial  recording  of  a  new  accession  and  the  completion  of  the  preparations  for  its  appearance  m  the 


Field 

Number 

Contents 

Length 
(  Characters) 

1 

Accession  number 

8 

2 

Hardcopy  or  Microfiche 

2 

3 

Title 

24 

4 

Author 

15 

5 

Pagination 

3 

6 

Stock  balance 

4 

7 

Stock  sent  out  (Stock  movements) 

4 

8 

Originator 

21 

9 

Availability 

9 

10 

First  linking  address 

2 

11 

Second  linking  address 

2 

Notes: 

1.  Economies  in  file  size  can  be  achieved  by  placing  binary  values  in  the 
numeric  fields. 

2.  Fields  10  and  1 1  are  used  to  link  the  Reports  File  with  the  Customer 
and  Transactions  Files. 

Fig.30  A  typical  Reports  File  record 


Field  Length 

Number  Contents  (Characters) 

1  Account  number  5 

2  Name  and  address  118 

3  Total  number  of  requests  received  4 

4  First  linking  address  2 

5  Second  linking  address  2 


Fig.3 1  A  typical  Customer  File  record 


Field  Length 

Number  Contents  (Characters) 

1  Request  number  5 

2  Customer  account  number  5 

3  Accession  number  8 

4  Quantity  ordered  4 

5  Date  issued  6 

6  Date  due  back  (loan)  6 

7  Date  of  request  6 

8  First  linking  address  2 

9  Second  linking  address  2 


Notes: 

1 .  Economies  in  file  size  can  be  achieved  by  placing  binary  values  in  the 
numeric  fields. 

2.  The  linking  address  fields  ;>re  used  to  link  the  three  Stock  Control  files 
together. 


Fig.32  A  typical  Transactions  File  record 
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announcement  journal.  The  amount  of  keying  required  to  identity  each  document  for  stock  control  purposes  is  small, 
and  it  generally  simplifies  the  system  and  reduces  the  risk  of  errors  if  a  fresh  start  is  made  when  the  full  particulars  of 
each  accession  are  ready  for  keying  into  the  main  bibliographic  files. 

The  Reports  Hie  entries  createu  at  the  in.tial  recording  stage  will  contain  the  basic  information  required  for  the 
Document  Processing  Sheet  (Tig.  16)  which  travels  with  each  document  to  control  the  complete  recording  operation. 
Where  operational  conditions  permit,  it  is  useful  to  eliminate  the  risk  of  transcription  errors  by  holding  the  stock  of  blank 
processing  sheets  on  continuous  stationery  and  then  printing  out  the  initial  identification  data  for  each  document  via  the 
computer.  It  is  then  necessary  to  match  up  each  partially  completed  processing  sheet  with  its  parent  document  before 
the  recording  procedure,  including  the  analysis  of  the  report  for  indexing  purposes,  can  be  completed. 

4.23  Forms  Design 

Many  of  the  forms  associated  with  the  control  ot  computer-based  systems  will  consist  of  simple  headed  lists  printed 
entirely  by  the  machine  and  providing  statistical  Jata  concerning  the  throughput  of  documents,  etc.  Torms  which  display 
textual  data  deii.ed  from  the  computer  but  intended  for  use  outside  the  computer  area,  such  as  the  Document  Processing 
Sheets,  invoice  forms  and  index  cards  are  greatly  improved  by  being  pre  printed  on  continuous  Stationery  of  a  suitable 
size,  and  there  are  certain  basic  rules  which  have  to  be  observed  when  such  forms  are  being  designed. 

It  is  general^  Accessary  to  allocate  space  for  each  data  field  on  the  basis  of  ten  characters  to  the  inch  across  the  term 
and  six  lines  to  the  inch  along  its  length.  Many  kinds  of  computer  printer  require  a  margin  between  the  edge  of  the  paper 
and  the  first  print  position  in  any  line,  ana  precise  positioning  of  the  text,  which  is  essential  if  the  completed  form  is  to 
be  acceptable  to  the  users,  can  only  be  achieved  if  the  computer  operator  sets  up  the  paper  accurately  in  relation  to 
datum  lines  marked  on  the  bed  of  the  printer.  If  required,  the  edge  strips  containing  the  sprocket  holes  can  be  removed 
by  guillotine  or  by  means  of  perforations  inserted  when  the  blank  forms  are  being  prc-pnnted.  The  use  of  coloured  paper 
car.  be  beneficial,  for  testa  have  shown  that  black  pnnt  on  yellow  paper  is  more  legible  than  black  on  white.  The  use  of 
two  colours  of  ink  on  any  form  is  not  rccommeded  since  it  is  of  limited  value  and  can  add  20rf  to  the  cost  of  the  pre¬ 
printed  stationery. 

Where  up  io  six  or  eight  copies  are  required  of  any  pages  of  computer  output  it  is  generally  most  economical  to  use 
multi  part  paper  with  carbon  inserts,  although  the  rather  expensive  'no  carbon  requued’  papers  are  more  convenient  to 
handle.  In  the  case  of  multi  part  pre-pnnted  stationery ,  it  is  often  advisable  to  use  different  colours  for  the  various  parts 
to  simplify  sorting  when  the  sheets  are  separated  or  ‘burst'.  Three  part  invoicing  forms  represent  a  good  example  of  this 
technique  if  the  various  parts  arc  made  instantly  recognisable  by  having  the  in.oice  itself  printed  on  white  paper,  the 
despatch  copy  on,  say,  yellow  and  the  accounting  copy  on  pink  or  some  other  contrasting  colour. 

4.24  Computer  Configurations 

The  GIX  1060  computer  used  for  the  processing  of  research  report  literature  at  the  Defence  Research  Information 
Centre  of  the  UR  MOD  is  illustrated  in  Tigure  33.  This  configuration  is  described  fully  in  the  paper  by  Hart  ..nd  included 
in  the  Conference  Proceedings  referenced  as  Item  12  in  the  Bibliography. 

The  Technology  Reports  Centre  of  the  UR  Department  of  Industry  has  a  Molecular  18  .omputer  of  similar  size, 
manufactured  by  Business  Computers  Limited.  The  TRC  machine  has  the  following  configuration. 

Central  processing  unit  (96k  bytes) 

Operating  console 
1  32  megabyte  disc  unit 
1  6.4  megabyte  disc  unit 
1  Magnetic  tape  unit 

6  Visual  disp'ay  key  stations  (VDU  terminals)  whit  1 280  character  screens 
1  Daisy-wheel  keyboard  printer  (45  char/sec) 

1  200  1  pm  printer  (upper  and  lower  case) 

1  75  char/sec  printer 
1  paper-tape  reader  (500  char/  ec) 

1  paper-tape  punch  (40  char/sec) 

5.  THE  USE  OF  MINI-COMPUTERS  IN  THE  PROCESSING  OF  RESEARCH  REPORT  LITERATURE 

The  development  of  modem  computer  technology  tends  towards  extreme  miniaturization  with  ever  increasing  date 
storage  capacities  occupying  an  ever-decreasing  space,  so  that  it  becomes  difficult  to  differentiate  between  the  mini 
computer  and  the  mam  frame  machine  of  moderate  capacity .  Hie  optimum  use  of  large  dal  t  bases  of  the  kind  described 
in  Uiapter  three  can  require  up  to  ten  100  megabyte  disc  units  on  line  at  any  one  time  and,  if  access  is  provided  fo. 
remote  terminals  over  telephone  land  lines  or  by  satellite,  the  computer  installation  must  also  be  equipped  with  special 
communications  control  equipment.  The  role  of  the  mini  computer  in  installations  of  this  size  becomes  that  of  the 
intelligent  terminal  fulfilling  data  capture  and  related  word  processing  functions  as  well  as  providing  off  line  printing  and 
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communications  contro'.  facilities.  The  communications  control  aspects  of  STI  .vorh  are  normally  defined  by  the 
manufacturers  of  the  m«.‘,i  frame  machine  and  the  National  Post  Office  authorities  and  form  a  highly  specialised  part 
of  the  total  system.  The  use  „T minicomputers  for  word  processing  and  as  off-line  print  stations  involves  many  of  the 
techniques  described  in  Chapter  four  and,  if  it  is  accepted  that  the  term  ‘mini  computer'  defines  machines  with  core 
capacities  up  to  16k  (16,000)  bytes,  the  effect  which  this  limitation  will  have  upon  file  and  program  sues  must  be 
considered  when  the  total  system  is  being  designed. 

5.1  Program  Aspects 

A  minicomputer  equipped  with  VDU  terminals  and  a  simple  printer  cc.)  be  used  very  effectively  for  on-line  program 
development,  with  the  progiams  being  written  onto  floppy  discs.  The  compilation  of  such  programs,  however,  normally 
requires  access  to  a  much  larger  machine,  and  the  sophisticated  software  offered  by  some  manufacturers  as  an  aid  to 
error  tracing  and  the  correction  of  programs  (debugging)  can  rarely  be  accommodated  in  less  than  32  k  bytes  of  core  if 
response  times  are  to  be  acceptable  to  the  programmers. 

Mini  computers  set  up  for  word  processing,  i.c.  data  capture  and  the  printing  ot  good-quality  output,  can  be  enhanced 
to  perform  arithmetic,  e  g.  accounting,  by  the  addition  of  suitable  logic  boards  which,  with  the  introduction  of  moucrn 
technology ,  do  not  require  enlarged  cabinets.  There  is,  however,  an  inevitable  constraint  upon  the  number  of  programs 
which  can  be  run  simultaneously,  and  any  over  loading  of  the  system  becomes  particularly  noticeable  in  the  slowing- 
down  of  interactive  operations,  i.e.  those  activities  which  are  based  upon  a  dialogue  between  the  usu  and  the  computer. 

5.2  Non-biblicgrnphic  Applications 

Reference  has  already  been  made  in  paragraph  4.22  to  the  value  of  stock  control  routines  within  a  comprehensive 
STI  system  and,  where  the  number  of  transactions  and  the  quantity  of  material  being  handled  justify  it.  these  non- 
bibliographic  functions  can  be  usefully  processed  on  a  dedicated  mini  computer.  Access  may  be  required  to  a  mam-frame 
machine,  possibly  the  computer  which  holds  the  data  base,  either  by  a  direct  cable  connection  through  an  interface  or  by 
the  use  of  compatible  tapes,  to  run  major  sorting  programs  and  to  allow  the  updating  of  files  which  hold  data  common 
to  both  the  stock  control  and  main  text  handling  systems.  Once  a  stock  control  system  has  been  set  up  m  this  way  it 
can  readily  be  extended  to  .over  accounting  procedures  and  various  other  housekeeping  jobs.  The  mini-computer  should 
be  equipped  with  its  own  printer  since  it  is  a  feature  of  some  stock  control  functions  that  they  require  a  dedicated  printer 
for  long  periods  although  the  total  quantity  of  printing  in  the  working  day  may  be  quite  small.  An  example  of  this  type 
of  application  is  found  in  the  initial  recording  of  new  accessions  referred  to  in  paragraph  4.22. 


6.  COMPUTER  15UREAU  OPERATIONS 

There  are  many  functions  within  a  comprehensive  STI  system  which  require  large  computer  resources.  The 
accumulation  of  a  single  data  base  can  result  in  the  addition  of  1 00  megaby  tes  of  on  lino  storage  capacity  cadi  y  car ,  the 
large  sorting  pre  jrnins  can  only  work  efficiently  if  more  than  32k  (32,000)  bytes  of  main  core  arc  exclusively  available 
while  they  are  being  run,  and  on  line  information  retrieval  from  ten  remote  terminals  operating  simultaneously  can 
increase  the  demand  for  core  capacity  by  up  to  eight  times  this  figure,  depending  upon  the  efficiency  of  the  software 
and  the  need  for  maintaining  rapid  response  times  between  the  computer  and  the  terminals.  Just  printing  out  the  results 
of  searches  in  such  a  system  could  create  a  daily  demand  for  over  three  hours  exclusive  use  of  a  printer  working  at  the 
rate  of  400  lines  per  minute  in  upper  and  lower  case  and,  although  some  components  of  computer  hardware  tend  to 
become  relatively  less  expensive  as  the  technology  develops,  it  is  generally  evident  that  installations  which  are  large 
enough  to  cope  with  the  demands  of  this  kind  ol  system  can  only  operate  economically  .f  their  resources  arc  being  fully 
exploited  for  the  majority  of  a  24  hour  day.  The  nature  of  the  STI  task  however,  creates  peaks  ot  utilisation  during  the 
normal  working  day  with  very  light  workdoads  at  other  times. 

One  way  out  of  this  dilemma  is  to  set  up  operations  on  the  basis  of  distubutcd  processing  with  the  aid  of  a  suitable 
computer  bureau.  This  implies  holding  major  files  on  a  large  main  frame  machine  linked,  preferably  by  means  of  a 
dedicated  telephone  line,  to  terminals  at  the  STI  Centre.  The  bureau  cannot  be  expected  to  exercise  the  kind  of  close 
control  of  files  required  fot  the  systematic  capture  of  reliable  and  accurate  bibliographic  data,  neither  would  a  general 
purpose  computer  be  normally  equipped  to  cope  witli  the  special  requirements  of  high-quality  printing.  Housekeeping 
and  stock  control  activities  such  as  those  outlined  in  paragraph  4.22  might  also  prove  impracticable  without  access  to 
relatively  simple  local  data  storage  and  printing  facilities,  since  parts  of  these  files  could  be  required  for  on  line  access 
throughout  the  working  day  and  a  bureau  may  net  be  able  to  offer  such  facilities  at  an  economic  cost. 

6.1  Setting  Up  the  Uurcau  Operation 

Two  types  of  computer  bureau  can  be  recommended  for  use  by  an  STI  Centre.  The  usual  kind  is  (lie  purely 
commercial  bureau  which  can  only  be  expected  to  offer  special  facilities  at  a  special  rate  of  payment  and  may  thorotorc 
prove  to  be  very  costly  in  the  handling  of  largo  bibliographic  files.  The  preferred  ty  pc  is  the  large  scale  machine  operated 
in  a  bureau  mode  by  an  educational  or  central  government  establishment,  and  organisations  of  tins  kind  have  been  known 
to  take  a  constructive  interest  in  the  special  implications  of  STI  processing  and  to  help  in  overcoming  the  inevitable 
problems  associated  with  large-scale  text  handling  in  a  bureau  environment, 
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Liaison  with  the  bureau  requires  the  attention  of  an  expert  in  ADI’  on  behalf  of  the  Centre  and,  according  to  the 
scale  of  the  operation,  involves  regular  meetings  with  the  bureau  management  particularly  during  the  early  stages  of  the 
operation.  Users  are  expected  to  familiarise  themselves  with  the  standard  procedures  adopted  by  the  bureau  and  to 
prepare  batch  work  in  accordance  with  any  special  rules  which  have  been  laid  down.  The  conditions  under  which  regular 
runs  are  earned  out  have  to  be  agreed,  with  particular  emphasis  given  to  those  jobs  which  are  time  critical.  It  is  often 
helpful  for  users  to  gain  some  understanding  of  the  machine  itself  and  its  operating  system  in  addition  to  a  complete 
knowledge  of  the  most  efficient  programming  procedures. 

An  STI  Centre  which  uses  a  bureau  to  handle  its  main  files  will  select  local  word  processing  terminals  for  data 
preparation  and  peripheral  devices  for  special  printing  etc.  on  the  basis  of  obtaining  the  best  value  for  the  availaole  money 
It  is  likely  then  that  there  will  be  problems  of  compatibility  with  the  mam  frame  machine  which  will  generally  provide 
its  best  response  only  if  linked  to  terminals  produced  by  specified  manufacturers.  These  problems  can  be  overcome  by 
introducing  software  or  hardware  interferes  which  will  act  as  simulators,  so  that  the  bureau  machine  ‘sees’  the  users’ 
terminals,  whatever  their  type,  as  devices  which  come  within  the  recommended  range  of  equipments. 

6.2  The  Advantages  of  Bureau  Operation 

The  principal  advantages  of  uureau  operation  have  already  been  indicated,  but  it  is  also  important  that  such  a  system 
should  be  shown  to  be  cost  effective  over  a  period  of  seven  to  ten  years.  As  well  as  offering  the  advantages  of  size,  staff 
resources  and  a  wide  range  of  facilities,  it  is  likely  that  the  bureau  will  provide  practical  help  and  advice  through  some 
form  of  user  support  organisation.  Thus  progress  towards  a  reliable  working  system  can  be  achieved  quite  rapidly  The 
bureau  operation  will  also  provide  the  staff  of  the  Centre  with  practical  experience  which  could  prove  invaluable  if  it  is 
decided,  at  a  later  date,  to  invest  in  a  dedicated  main-frame  machine. 

6.3  The  Disadvantages  of  Burea  Operation 

The  disadvantages  of  bureau  operation  are  generally  related  to  the  fact  that  the  STI  Centre  will  inevitably  relinquish 
absolute  control  over  various  important  computer  activities.  This  can  be  especially  significant  during  the  development 
period  when  the  test  runsof  new  programs  are  being  submitted,  since  the  time  necessary  for  getting  work  into  the 
computer  and  receiving  results  in  the  form  of  print-out  can  add  days  to  the  time  required  by  the  machine  itself.  It  can 
be  particularly  frustrating  when  a  job  fails  because  of  some  simple  error  which  would  have  been  easily  put  right  at  run 
time  on  an  in-house  machine. 

Operator  errors  at  the  bureau  can  also  create  problems,  particularly  if  the  printer  is  incorrectly  set  up,  or  the  output 
is  printed  on  the  wrong  paper  or,  upon  completion,  is  sent  to  the  wrong  customer.  It  has  been  known  for  jobs  to  fail 
because  the  constituent  programs  have  been  run  in  the  wrong  sequence,  and  problems  can  also  arise  if  output  in  forma 
tion  is  assigned  to  the  wrong  disc  or  magnetic  tape  file,  thus  corrupting  essential  data. 

Despite  these  possible  disadvantages,  there  is  much  to  commend  the  use  of  a  bureau  to  support  computer  based  STI 
operations.  Many  of  the  problems  which  have  been  described  can  be  mitigated  if  user  groups  or  similar  liaison  bodies  arc 
rst  up  to  disseminate  information  about  system  changes  etc,  possibly  by  the  issue  of  news  sheets.  The  local  mini 
jinputer  linked  to  a  remote  main  frame  machine  can,  in  the  long  term,  form  part  of  a  communications  network  which 
might  be  developed  to  bring  a  number  of  STI  Centres  together  for  the  mutually  beneficial  sharing  of  information 
resources. 
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ADP 

Batch  processing 

Bit 

Block 

Boolean  algebra 

Buffer 

Byte 

Character 

Character  set 
Compile 

Continuous  stationery 

Control  character 
COSATI 
Data  capture 

Down  time 
Edit 

Field 

File 

Flag 

Floppv-disc 
or  flexible  disc 
or  diskette 

Hard-wired  logic 

Interface 

Interpreter 
Main  frame 

NTIS 

Off-line 


Automatic  Data  Processing 

A  mode  of  computer  operation  in  which  the  data  inputs  for  a  number  of  tasks  requiring 
the  same  program  are  accumulated  over  a  period  of  time  before  being  processed  in  a 
single  machine  run. 

The  smallest  unit  of  information  in  a  computer:  a  unit  of  storage  capacity. 

A  group  of  records,  words,  characters  or  digits  handled  as  a  unit  in  a  computer  system. 

A  binary  system  of  algebra  originally  formulated  by  G.S.Boole  and  based  on  the  operators 
AND,  OR,  NOT. 

Storage  used  to  accommodate  a  difference  in  the  data  handling  Tate  of  two  devices  when 
information  is  being  transmitted  from  one  to  the  other. 

A  set  of  adjacent  bits  operated  upon  as  a  unit  in  a  computer.  A  byte  is  generally  smaller 
than  a  word  and  typically  eight  bits  or  six  bits  in  length. 

A  digit,  letter  or  other  symbol  used  in  the  representation  of  information.  In  computers  a 
character  is  represented  by  a  pattern  of  adjacent  bits. 

A  finite  set  of  different,  specified  characters. 

To  translate  a  source  program  into  a  program  which  the  computer  can  execute. 

A  continuous  strip  of  paper  in  roll  or  fanfold  form,  usually  with  uniformly  spaced  holes 
near  the  edges  for  sprocket  drive  purposes  and  a  transverse  line  of  perforations  betsveen 
sheets.  Used  as  an  output  medium  on  line  printers. 

A  character  whose  function  is  to  initiate  a  computer  operation. 

Committee  on  Scientific  and  Technical  Information  (see  Section  1  of  the  Manual). 

Tire  acquisition,  in  machine  readable  form,  of  data  to  be  processed  by  a  computer  (also 
data  preparation  or  data  prep) 

The  time  during  which  a  computer  is  not  available  for  operation  due  to  malfunctioning. 

To  prepare  data  for  subsequent  processing  or  output  by  modifying  the  content,  form  or 
format. 

A  specified  part  of  a  record  reserved  for  a  particular  category  of  information. 

A  set  of  related  records  treated  as  a  unit. 

Any  of  various  types  of  indicators  used  for  identification:  or  a  character  that  signals  the 
occurrence  of  some  condition  such  as  the  end  of  the  word. 

A  low-cost,  compact  magnetic  storage  device  comprising  a  small  disc  of  flexible  material. 
The  information  is  stored  on  circular  tracks  in  the  conventional  way,  but  each  disc  is 
contained  in  a  sealed  plastic  envelope  which  has  a  slit  from  the  centre  to  the  outer  edge. 
This  slit  provides  access  for  the  read/write  head. 

A  term  used  to  describe  the  control  of  functions,  within  the  computer  or  its  peripheral 
equipment,  by  means  of  readily  exchangeable  printed  circuit  boards,  generally  as  an 
alternative  to  the  control  of  functions  by  program  logic.  The  use  of  hard-wired  logic  is 
increasing  with  the  development  of  microprocessor  technology. 

An  area  common  to  two  systems  or  in  which  two  systems  interact  with  each  other, 
e.g.  the  hardware  linking  a  computer’s  central  processing  unit  with  the  peripheral  device. 

A  device  that  prints  on  a  punched  card  the  data  already  punched  in  the  card. 

Synonymous  with  Central  Processing  Unit;  generally  implying  a  computer  of  large 
capacity. 

National  Technical  Information  Service.  Department  of  Commerce. 

Relating  to  the  use  of  peripheral  equipment,  cither  uncontrolled  by  the  cent'  .1  processing 
unit  or  initiated  by  the  unit  at  a  time  subsequent  to  the  current  operation. 
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Parameter 
Recoal 
Ron  time 
SDI 
Word 
Work  area 


A  variable  that  is  given  a  constant  value  for  a  specific  purpose  or  process. 

A  group  of  related  items  of  information  treated  as  a  unit. 

The  time  during  which  a  computer  program  or  routine  is  being  executed. 

The  Selective  Dissemination  of  Information. 

A  sot  of  adjacent  bits  or  characters  stored  or  operated  upon  as  a  unit  in  a  computer. 
A  storage  location  reserved  for  temporary  results. 


Section  6 


ANNOUNCEMENT  SERVICES  AND  PUBLICATIONS 
by 

Elizabeth  Il.kidlcr 

Defence  Research  Information  Centre 
Ministry  of  Defence 
Orpington,  Kent,  UK 


ABSTRACT 


The  methods  by  which  an  Information  Centre  announces  its  holdings  and  recent 
acquisitions  to  its  users  are  surveyed.  Examples  arc  given  of  manually  and  computer 
produced  bulletins  from  a  number  of  Information  Centres  and  Libraries,  and  production  of 
indexes  to  computer  produced  bulletins  is  discussed.  Manual  and  computer-based  SD! 
services  are  described  with  notes  on  profile  construction,  use  of  commercial  magnetic  tapes 
to  extend  an  in-house  SDI  sendee,  and  standard  profiles.  Other  services  which  involve 
repackaging  the  literature  resources  of  a  Centre  are  reviewed,  including  bibliographies, 
state  of  the  art  reports,  and  packaged  information  to  assist  industry  with  technical 
innovation. 
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1.  INTRODUCTION 

1.1  Tins  section  of  the  Manual  deals  with  the  types  of  announcement  services  and  publications  which  are  produced  by 
an  Information  Centre  as  some  of  its  rnu^or  tasks.  However  extensive  are  the  holdings  of  the  Centre  and  valuable  its 
acquisitions,  they  are  of  no  real  benefit  until  the  Centre’s  users  are  made  aware  of  their  existence. 

1 .2  Information  .mpiics  communication,  a  transfer  of  knowledge  from  a  source  to  a  receiver.  Modern  developments 
in  using  electronics  for  communications  have  not  yet  superseded  the  long-established  medium  of  print  on  paper,  and 
this  section  wil)  deal  mainly  with  the  use  of  print  on  paper  for  communicating  technical  information  in  the  form  of 
regular  literature  and  accessions  bulletins,  selective,  dissemination  of  information  (SD1)  services,  special  bibliographies, 
reviews,  and  other  methods  of  packaging  information.  The  information  which  is  to  be  communicated  will  be  mainly  that 
derived  by  the  information  scientists  of  the  Cent.e  from  original  work  already  commited  to  paper  by  scientists  and 
engineers  in  the  form  of  books,  technical  reports,  and  papers  given  at  conferences  or  published  in  learned  and  technical 
journals,  the  so-called  ‘primary  publications'. 

1 .3  The  introduction  ot  computer  processing  in  Information  Centres  has  necessitated  a  degree  of  standardisation,  but 
this  does  not  mean  that  tnere  is  no  room  for  individual  styles  in  the  publications  of  a  Centre.  As  Kipling1  has  said  -  ~ 

“There  are  nine  and  sixty  ways  of  constructing  tribal  lays  and  every  single  one  of  them  is  right.’ 

Guidelines  will  therefore  be  given  on  methods  for  the  production  of  the  types  of  publication  a  Centre  can  provide  with 
examples  from  services  provided  by  national  and  other  large  information  Centres  to  illustrate  their  different  approaches 
to  the  problems.  The  examples  will  include  some  commercial  services  which  provide  information  packages  which  can 
supplement  an  in-house  service  or  provide  models  for  the  development  of  new  services  by  the  Centre. 


2.  BULLETINS 

2.1  Literature  Bulletins 

information  Centres  which  serve  their  users  with  published  information  will  prepare  current  awareness  bulletins  of 
articles  from  journals  received  in  the  Centre.  In  its  simplest  form  the  bulletin  can  be  produced  purely  by  clerical  effort 
by  circulating  journal  contents  lists.  A  more  useful  bulletin  is  achieved  if  the  information  scientists  in  the  Centre  scan 
tne  journals  and  select  the  more  important  articles  to  announce.  Although  the  marked  up  journals  can  be  placed  in  the 
desired  subject  order  and  passed  to  a  typist,  the  journals  are  out  of  circulation  fo:  a  shorter  period,  and  a  more  uniform 
product  is  obtained  if  bibliographical  details  of  each  selected  article  are  transferred  to  a  work  sheet,  on  which  the 
elements  are  arranged  in  the  order  in  which  they  are  to  appear  in  the  bulletin.  A  subject  classification  code  is  added  for 
each  item,  the  sheets  arranged  m  subject  order,  subject  heading  slips  intercollaled,  and  with  the  addition  of  the  title  page 
the  pack  of  work  sheets  is  ready  for  typing  and  reproduction.  Blick  and  Magrill2  describe  such  a  system  for  pharmacolo¬ 
gical  research  scientists.  Information  scientists  scanned  journals  for  items  of  interest  to  project  teams.  Titles  of  relevant 
items  with  author,  bibliographical  reference  and  language  constituted  the  bulletin  entry.  A  weekly  bulletin  containing 
500-1000  references  was  circulated,  saving  each  scientist  one  to  two  hours  a  week  in  journal  scanning  time  It  was 
estimated  that  the  cost  benefit  of  the  bulletin  was  equal  to  about  3.6  times  its  cost.  This  method  of  selection  can 
approach  manual  selective  dissemination  of  information  services  which  arc  discussed  more  fully  in  paragraph  3 

The  inclusion  of  abstracts  in  literature  bulletins  enhance  their  usefulness  but  also  increases  bulk  and  cost.  There  are, 
however,  so  many  commercially  available  abstracting  services  that  to  prepare  a  literature  bulletin  with  abstracts  in  an 
Information  Centre  is  justified  only  if  the  Centre  specialises  in  a  particular  discipline  or  an  interdisciplinary  subject,  as 
many  Information  Analysis  Centres  do,  or  if  the  language  used  by  the  commercial  abstract  journals  in  the  subjects 
required  is  not  the  native  language  of  the  Centre  and  the  majority  of  its  users.  The  basic  steps  for  compiling  a  literature 
bulletin  with  abstracts  will  be  the  same  as  for  a  titles  only  bulletin,  but  an  editorial  staff  will  be  required  to  maintain 
quality  control  of  the  abstracts  and  ensure  a  consistent  product.  The  cost  of  the  bulletin  cun  be  recovered  by  placing  it 
on  sale  and  two  bulletins,  not  in  the  English  language,  are  good  examples  of  this  course  of  c'.’ion.  In  France  the  Centre 
National  de  la  Recherche  Scientifique  (CNRS)  issues  the  Bulletin  Signaletique  in  50  subject  sections  under  the  main 
headings  Exact  Sciences,  Biologicil  and  Medical  Sciences,  Engineering  Sciences.  The  bulletins  are  computer-produced 
and  each  is  issued  ten  times  a  year,  each  issue  containing  subject  and  author  indexes.  (Further  information  from- 
Informascience,  Centre  de  Documentation  Scientifique  et  Technique,  Centre  National  de  la  Re'cherchc  Scientifique, 

26  Rue  Boyer,  75971  Paris  Cedcx  20,  France).  In  the  Soviet  Union  the  All-Union  Institute  for  Scientific  and  Technical 
Information  (V1N1TI)  publishes  Referativny  Zlmmal,  issued  in  several  sections  including  Astronomy  and  Geodesy, 
Biochemistry,  Biology,  Chemistry,  Electrotechnology,  Geography,  Geology,  Geophysics,  Informatics,  Machine  Construc¬ 
tion,  Mathematics,  Mechanics,  Metallurgy,  Physics.  Abstracting  services  on  such  a  scale  are  usually  computer-based,  and 
produce  computer  searchable  data  bases  as  a  by-product  of  bulletin  production.  Thus  the  PASCAL  data  base,  available 
on  the  European  Space  Agency’s  Information  Retrieval  Service  RECON  system,  is  produced  as  a  by-product  of  Bulletin 
Signaletique. 

2.2  Accessions  Bulletins 

The  accessions  bulletin  is  one  of  the  most  important  announcement  media  of  an  Information  Centre,  especially 
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where  the  accessions  relate  to  documents  of  limited  availability  such  as  technical  reports  where  only  a  limited  number  are 
printed  and  the  Centre  may  be  the  main  distribution  point  for  them.  The  function  if  such  a  bulletin  is  first  to  announce 
to  users  what  has  recently  been  received  in  the  Centre,  but  a  well-produced  and  indexed  bulletin  is  also  a  permanent 
reference  tool  for  retrospective  search.  The  function  of  the  bulletin  in  promoting  the  image  of  the  Centre  must  also  be 
beme  in  mind  when  considering  its  content  and  appearance. 

The  layout  of  the  bulletin  can  be  a  matter  of  choice  for  the  Centre,  and  some  typical  examples  of  bulletin  entries 
will  be  illustrated  in  the  following  paragraphs.  It  will  depend  to  a  iarge  extent  on  whether  the  bulletin  is  produced  by 
manual  methods  or  by  computer.  There  arc  some  geneial  points  which  should  be  remembered.  The  user  will  scan  each 
new  issue  to  inform  himself  of  new  documents  in  his  speciality.  Items  should  therefore  be  grouped  according  to  subject 
for  his  convenience.  If  the  Centre  uses  a  numerical  classification  such  as  the  Universal  Decimal  Classification  (UDC)  the 
items  are  grouped  under  broad  UDC  headings.  Centres  which  use  subject  indexing  terms  (descriptors)  e.g.  from  the 
Thesaurus  of  Engineering  and  Scientific  Terms  (TEST)5  will  probably  prefer  to  use  a  published  set  of  subject  category 
codes  such  as  those  issued  by  COSAT1  (see  this  Manual  Vol.l,  pp.14-15).  Individual  items  should  be  laid  out  so  that 
important  information  is  prominent.  Titles  should  be  printed  in  capitals  or  in  bold  type,  and  the  number  which  the 
Centre  uses  to  identify  the  item  in  its  system,  usually  the  Centre’s  accession  number,  should  be  easily  identified.  It  is 
helpful  to  users  if  introductor’  pages  are  included  in  the  bulletin  giving  information  on  how  to  obtain  the  documents 
cited,  and  a  guide  to  the  layout  of  items.  An  example  of  such  a  guide,  from  the  UK  department  of  Industry’s  Technology 
Reports  Centre’s  R  &  D  Abstracts,  is  shown  in  Figure  1 . 

If  some  of  the  items  to  be  announced  are  restricted  to  certain  users  because  of  security  or  commercial  limitations, 
special  supplements  to  the  bulletin  ora  separate  restricted  edition  should  be  compiled  and  circulated  only  to  those  users 
entitled  to  receive  the  information. 

2. 2. 1  Manually-Produced  Bulletins 

The  format  of  a  manually-produced  accessions  bulletin  will  depend  on  the  eilort  available  in  the  Centre,  and  the 
use  to  which  it  is  put.  For  immediate  current  awareness  where  speed  of  issue  ,s  all-important,  a  titles-only  bulletin  will  be 
quick  to  produce  and  will  involve  the  minimum  of  intellectual  effort.  It  can  be  simple  typing  operation  based  on 
copying  marked  fields  on  the  work  sheet  made  out  for  each  new  document,  the  slreere  Erst  being  sorted  into  the  subject 
order  required.  An  example  of  svich  a  bulletin  is  that  currently  isn-.cd  by  the  British  Library'  Lending  Division,  Boston 
Spa,  as  the  BLLD  Announcement  Bulletin.  A  typical  entry  is  shown  in  Figi’n,  \  The  -''.cries  are  arranged  in  main 
COSATI  subject  fields.  An  author  index  is  included. 

A  bulletin  will  be  more  informative  if  abstracts  are  included,  although  ptoducL’on  *viil  Wre  longer,  more  information 
scientists’  time  will  be  involved  and  the  typing  and  reprographic  load  will  be  grea'er.  As  with  literature  bulletins 
containing  abstracts,  an  editorial  staff  will  be  required  to  ensure  quality  and  consistent  er  tries.  Some  economy  can  be 
achieved  by  coordinating  bulletin  production  with  other  ’.jocesses  in  the  Centre.  A  Centre  which  dots  not  have  access 
to  a  computer  for  searching  their  records  will  probably  rely  on  type*  catalogue  cards  for  its  subject,  author  and  other 
indexes.  Bulletin  and  catalogue  card  production  can  be  coordinated  in  various  weys.  After  typing  the  cards  one  set  can 
be  arranged  in  bulletin  page  order,  and  offset  litho  plates  made  photographically  An  ^tentative  method  is  to  type  the 
bulletin  pages  first  and  to  carry  out  two  separate  print  runs,  the  first  to  prepare  the  btsl’erio  pages  and  the  second  to 
prepare  cards  ,:xtra  copies  of  each  page  can  be  run  off  on  gummed  paper,  the  abstracts  cut  out  and  stuck  on  to  cant. 
Alternatively  the  extra  pages  are  run  off  on  card  stock  which  is  subseqi.”  ntly  guillotined  to  size.  This  lattci  method  was 
used  in  the  predecessor  of  the  UK  Defence  Research  Information  Centre,  the  Ministry  cf  Aviation’s  Technical  Informa¬ 
tion  and  Library  Service  up  to  196S  (Vessey4)  These  methods  entail  the  use  of  a  unit  card  for  all  the  catalogues  used  in 
the  Centre,  although  typed  additions  to  the  cards  can  be  made  if  necessary. 

in  the  intermediate  stage  before  computerisation,  tape  typewriters,  described  in  detail  In  Section  4  of  this  Manual, 
have  been  used  to  produce  bulletins  and  cards  from  one  typing  operation.  The  cards  are  typed  first  and  tbs  bulletin  is 
subsequently  typed  automatically  using  the  punched  tape  produced  when  typing  the  cards.  Wilson5  describes  the  use 
of  a  Friden  Flexowriter  with  edge-punched  cards  instead  of  paper  tape  at  the  Atomic  Energy  Rcseatch  Establishment, 
Harwell,  to  produce  catalogue  cards,  the  monlhlv  library  accessions  list  and  lists  of  unclassified  UK  Atomic  Energy 
Authority  reports  from  one  master  typing,  with  a  saving  in  clerical  and  typing  effort  estimated  at  1000  man  hours  a  year. 
Vickers*  describes  the  use  of  Vonamatic  tape  typewriters  at  the  UK  Central  Electricity  Generating  Board  Information 
Service  *o  produce  library  catalogue  cards,  reports  lists  and  a  library  bulletin,  with  considerable  saving  in  typing  and 
clerical  effort. 

The  main  disadvantage  with  a  manually  produced  bulletin  is  the  difficulty  of  providing  indexes,  particularly  subject 
indexes,  which  are  the  most  important  if  the  bulletin  is  to  be  used  as  a  retrospective  search  tool.  SMra  cards  must  be 
written  or  typed  and  sorted  into  order  and  the  final  index  retyped.  For  production  of  indexed  bulletins  some  measure 
of  computer  assistance  is  essential. 


2.2.2  Computer-Produced  Bulletins 

Data  preparation  for  computer  processing  and  computers  in  Information  Centres  are  dealt  with  in  detai’  in  Sections 
4  and  5  of  this  Manual.  Only  an  outline  of  some  actual  systems  used  for  bul'etin  production  is  therefore  given  here. 
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A  TYPICAL  ABSTRACT 

BIBLIOGRAPHIC.  INFORMATION,  ABSTRACT,  AND  INDEXING  TERMS 


Corporate  Author  B  &D  Abstracts,  Volume  &  Issue 

7  \ 

TRC  Accession  Number/  Corporate  Author's  Reference  Number  \ 


T78-0378  /  WSL-LR-296-AP  3902 

Warren  Spring  Lab, ,  Stevenage,  Herts. ,  UK 
ENVIRONMENTAL  STUDY  OF  AN  ACTIVATED  CARBON  PLANT 
Clayton,  P.  WallLn,  S.C.  1978  14pp  2ref 

TRC 

A  summary  of  the  environmental  aspects  arising  from  the  production 
of  activated  carbon.  The  methods  for  pollution  control  are  specified  and 
the  emissions  and  predicted  ambient  concentrations  given.  Recommen¬ 
dations  are  made  to  achieve  satisfactory  operation  of  pollution  control 
equipment  for  normal  and  malfunction  conditions. 

—  ^Activated  carbon/*Pollutlon/*Flue  gases/Actlvated  carbon  treatment 
/Air  pollution/Health/Fumes/Dust/Odors/+Kiln  gases/HG/13B/68A/71C/ 


Availability  of  reports 


Indexing  terms 


Most  reports  ate  available  from  TRC.  Ask 
for  order  forms  TR  -100.  ( Price  List  TR  406; 

Rules  of  supply  TR  -102). 

Other  reports  are  available  from  the  Corporate 
Author  (full  address  given  with  each  abstract) 
or  HMSQ  for  which  requests  should  be  sent 
to;  HMSO.  PO  Box  569, 

London,  SE1  9NH. 


•  E1C-TEST  thesaurus  terms  for  subject 
index  (•) 

•  other  EJC-TEST  terms 

•  non- EJC-TEST  terms  (+) 

•  Cosaii  code  used  for  location  in  R  &  D 
ABSTRACTS  (//) 

•  other  Cosati  and  NTIS  codes  (/) 


®  Crown  Copyright  1979 

Extracts  may  be  freely  published 
providing  the  source  is  acknowledged. 


Fig.l  Guide  to  lay-out  of  Technology  Reports  Centre’s  R  &  l>  Abstracts 


R 

INFORMATION  SERVICES  IN  THE  MARKET  PLACE 
3irks,  C.I. 

*  British  Library  Research  S  Devel:p=eni  Report 

*  BLRDR  5430  (BLLD  2113.5t'F) 

Available  only  froa  B.L.b.D. 

July  1978  pp.75  E3.00 


Fig.2  Typical  entry’  in  BLLD  Announcement  Bulletin 


When  a  bulletin  is  produced  by  computer  the  bibliographical  data  must  be  prepared  in  machine  readable  form.  Tape 
typewriters  are  frequently  used  for  this  purpose.  Operational  systems  using  tape  typewriters  have  included  the  UK 
Technology  Reports  Centre's  original  system  for  R  &  D  Abstracts  (Schuler7  /  and  the  French  CNRS  system  used  for 
preparing  Bulletin  Signaleiiquc  (d’Olier  and  Dusoulier8).  Other  methods  for  computer  input  include  Optical  Character 
Recognition  (OCR)  used  (1974)  in  the  US  Department  of  Navy  Ship  Systems  and  Scientific  Documentation  Division 
(Smith9)  and  direct  entry  into  the  computer  from  an  on-line  terminal  using  a  visual  L  splay  unit,  as  is  now  used  for  R  &  D 
Abstracts  (Adams10).  Advantages  and  disadvantages  of  each  of  these  methods  have  been  summarised  by  Mclvor". 
Development  of  minicomputers  has  led  to  the  possibility  of  smaller  Centres  having  their  own  computer,  and  bulletin 
production  is  one  of  the  tasks  very  suitable  for  running  on  such  machines.  The  preparation  of  the  UK  Defence  Research 
Information  Centre’s  abstract  bulletin  Defence  Research  Abstracts  is  now  carried  out  completely  in-house  using  a  GEC 
4080,  the  processes  bemg  described  in  detail  by  Hart 15  and  in  Canada  the  Defence  Scientific  Information  Service  prepares 
Documents  Digest  using  a  Varian  620i  (Mclvor1).  The  ultimate  in  computer  production  of  bulletins  is  computer  type¬ 
setting,  described  in  Section  5  of  this  Manual.  At  present  this  is  only  economic  for  larger  organisations  such  as  NTIS  and 
NASA.  Figures  3  and  4  show  typical  entries  in  the  NTIS  Government  Reports  Announcements  and  the  NASA  STAR 
(Scientific  and  Technical  Aerospace  Reports). 


AO-AOSS  211/7GA  PC  A02/MF  AOt 

Advisory  Grout/  lor  Aerospace  Research  and 
Development  NoulUy-Sur-S*!no  (France) 
Technical  Evaluation  Report  on  tha  Meeting 
ol  tha  Propulsion  and  Energetics  Panal  (50th) 
A  Symposium  on  Hlgh-Tomparaluro  Problems 
In  Gss  Turbina  Engines. 

Advisory  ropt.. 

R.  Eggobrecht,  and  S.  Lombardo.  Mar  78. 20p 

Repl  no.AGARO-AR.1t6 

See  a'jo  Rest.  no.  AGAHD-CP-229.  AD-A052 

B<5. 

Descriptors:  'Gas  turbines.  -Military  research. 
Symposia,  High  tomperaluro,  Heat  transfer. 
Film  cooling,  Cooling.  Transpiration, 
Aerodynamics.  Combustors,  Combustion 
chambers.  Afterburners.  Heat  resistant  alloys. 
Thermal  insulation,  Coalings. 

Idontltiors:  NATO  furnished. 

This  Technical  Evaluallon  Report  on  the  50th 
PEP  Symposium  on  ‘Mlgh-Tomporaturo 
Problems  In  Gas  Turblno  Engines’  contains  a 
critical  survey  ot  the  37  papers  presonted. 
which  are  grouped  according  to  the  various 
subjects  covered  In  this  Symposium.  (Progress 
o(  New  Research  and  Development  Tost  Facili¬ 
ties,  Cooling  Techniques  and  Hoat  Transfor  In¬ 
vestigations,  Ettoct  o!  Turblno  Cooling  on 
Aerodynamic  Performance,  Combustors  and 
Afterburners,  High  Tomperaluro  Materials  and 
Coatings,  Overall  Englno  Design  and  Per¬ 
formance  Aspocls).  Conclusions  are  drawn  and 
aroas  with  high ’priority  (uturo  Intorost  are 
marked.  Tho  papers  presented  ot  the  meeting, 
together  with  the  dlacusslons,  are  published  as 
AGARO  Conloronco  Proceedings  CP-229  ’High- 
Temperature  Problems  In  Gas  Turblno  En¬ 
gines  (Author) 


Fig.3  Typical  entry  in  Government  Reports  Announcements 


7.2 .?  Bulletin  Indexes 

A  manually  produced  bulletin,  as  stated  earlier,  is  unlikely  to  include  indexes  because  of  the  large  amour-'  of  labour 
involved.  Computer-produced  bulletins  can  be  indexed  according  vo  any  of  the  data  fields  chosen.  The  most  important 
index  for  retrospective  search  is  the  subject  index,  but  the  Centre  will  receive  requests  for  documents  not  only  by  subject. 
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N77-32177#  Advisory  Group  tor  Aarospaco  Research  and 
Dovelopmem,  Pans  (France). 

A  FURTHER  REVIEW  OF  CURRENT  RESEARCH  RELATED 
TO  THE  DESIGN  AND  OPERATION  OF  LARGE  WIND 
TUNNELS 

Aug.  1977  (30  p  tefs 

(AGAR0-AR.105)  Avail-  NTIS  HC  A07/MF  A01 

Research  related  to  windtunnel  design  and  operation  is 
presented  Current  results  and  planned  effort  for  346  studies 
and  research  investigations  underway  in  nine  countries  are 
reviewed  and  commented  upon  The  work  of  the  Wind  Tunnel 
Test  Techniques  (TESI  Subcommittee  is  described  and  tha  rationale 
for  the  effort  is  given.  Main  conclusions  and'iecommendations 
are  sumroarited.  and  lists  ol  titles,  investigators’  names  and 
locations  for  tiro  research  and  studies  reviewed  are  included. 

Author 


Fig.4  Typical  entry  in  STAR 


but  by  author,  title,  document  number,  corporate  author,  contract  number,  etc.  Printed  indexes  for  each  of  these  data 
elements  will  provide  a  search  facility  for  the  Centre’s  clerical  staff  who  deal  with  requests  fo  specific  documents,  and 
allow  the  Centre  to  dispense  with  a  number  of  separate  card  catalogues  with  consequent  savir  g  in  the  time  taken  for  card 
filing.  Except  for  the  subject  index  the  indexes  can  be  produced  using  simple  computer  sort  programs.  Consistency  of 
recording  is  important  and  the  rules  given  in  Section  2  of  the  Manual  must  be  followed  to  ensure  standardized  entries  in 
the  indexes. 

The  subject  index  is  not  so  straightforward.  It  will  depend  on  the  indexing  system  used  in  the  bulletin.  If  the  UDC 
is  employed,  it  may  appear  to  be  difficult  to  sort  by  computer,  but  programs  have  been  developed  for  the  purpose  by  the 
Zentralstelle  fur  maschinellc  Dokumcntation,  Frankfurt  am  Main  and  described  by  Schneider  and  Koch14  Subject 
indexes  based  on  descriptors  can  be  organised  in  different  ways.  Single  descriptors  (single  or  multiple  term  concepts') 
can  be  identified  separately  to  the  computer  in  the  data  preparation  (set  Manual,  Section  4)  and  each  is  printed  out  in 
turn  associated  with  the  document  title  and,  if  desired,  the  full  bibliographical  reference.  This  method  lias  been 
developed  at  the  UK  Aircraft  Research  Association  Ltd  and  is  described  by  Barncttis.  Complex  subject  headings  can  be 
obtained  by  associating  two  descriptors.  The  main  heading  is  identified  by  marking  the  relevant  descriptor  with  an 
asterisk,  and  the  qualifying  descriptor  is  separated  from  it  by  a  comma.  The  association  is  made  intellectually  at  the 
editorial  stage.  This  method,  used  in  the  NTIS  Government  Reports  Announcements  is  also  employed  at  the  Defence 
Research  Information  Centre  and  is  illustrated  in  Figure  5  which  shows  one  item  from  Defence  Research  Abstracts  and 
Figure  (i  which  shows  the  position  of  this  item  in  the  subject  index. 

Mention  should  also  be  made  of  a  form  of  subject  indexing  based  on  manipulating  document  titles,  KWIC  (KeyWord 
In  Context)  and  KWOC  (KeyWord  Out  of  Context).  There  are  several  computer  programs  available  for  preparing  KWIC 
and  KWOC  indexes.  A  list  of  unimportant  words  is  made  -  the  ‘slop  list’.  All  other  title  words  are  selected  in  turn  by 
the  computer  and  cither  printed  centrally  down  the  page  in  alphabetical  order,  with  the  remaining  title  words  in  order 
on  each  side  (KWIC),  or  printed  at  the  side  of  the  page  (KWOC)  with  the  whole  title  repeated  to  the  right,  Fach  entry 
is  associated  with  an  identification  code.  The  usefulness  of  KWIC  and  KWOC  indexes  depends,  of  course,  on  document 
titles  being  informative. 

It  is  not  necessary  for  every  issue  of  the  bulletin  to  contain  a  complete  set  of  indexes.  A  useful  minimum  is  subject, 
personal  author,  and  an  index  associating  the  original  document  number  witli  the  Centre’s  accession  number  A  complete 
set  of  indexes  should  be  issued  as  a  separate  volume  or  volumes,  preferably  at  quarterly  intervals  with  an  annual  cumula¬ 
tion.  These  index  volumes  will  become  the  main  search  tools  for  the  Centre  for  identifying  individual  documents  and  for 
conducting  subject  searches. 

2.3  Staff  Requirements 

For  a  service  where  intellectual  scanning  of  journals  is  required,  but  abstracts  are  not  included,  one  information 
scientist  can  select,  index  and  check  about  300  citations  per  week  if  working  full-time  on  the  task2.  When  abstracts  arc 
included,  a  reasonable  time  for  each  abstract  with  indexing  terms  is  20  30  minutes,  giving  a  weekly  output  of  about 
80  abstracts.  If  a  large  proportion  of  author  abstracts  can  be  used,  100  abstracts  per  week  should  be  expected  The 
editorial  staff  can  be  expected  to  edit  300-400  items  per  week  each,  working  full-time.  Such  staff  would,  however, 
normally  have  additional  duties  related  to  the  Centre’s  other  publications. 


3.  SELECTIVE  DISSEMINATION  OF  INFORMATION  (SD1) 

As  its  name  implies,  SDI  is  a  service  in  which  information  is  tailored  to  user  needs.  Development  of  SD1  services 
has  increased  rapidly  with  the  increase  in  computer  processing,  although  manual  systems  are  successfully  operated  Both 
manual  and  computer-based  SDI  will  be  considered,  with  the  emphasis  on  the  latter. 


1  AERONAUTICS 


1-2  AERONAU,  CS 


(2001-77011 

BR-5«759  RAE-TR-76123  UNLIMITEO 

Roys!  Aircraft  Est • #Farnborouah, Hants. /UK 
PRECISION  APPROACH  PATH  INDICATOR  -  PAPI 

Smi th/ A , J,  Johnson#D,  9,1976  25op  5ref 

FS-39 

MP  AVAIL 

♦  Landing  aids#PAPI ( l andi ng  «id)/*Guide  path  landing  syste«s#PAPI ( landing  aid)/ 

•PAPI (1 anding  aid) , Ai rcraf t  1  and! ng/Approach  lights/ 

The  Precision  Approach  Path  Indicator  (PAPI)  Is  a  staple  visual  aid  that  has  been 
developed  to  assist  pilots  during  their  approach  to  landing.  It  enables  pilots  to 
acquire  the  correct  glldeslope  and  subsequently  to  maintain  their  position  on  it#  thus 
ensuring  an  accurate  approach  and  landing.  Descriptions  of  two  existing  systems#  VASI 
and.T-VASI#  ere  included  together  with  a  brief  description  of  PAPI,  The  operational 
requirements#  both  current  and  future#  of  such  systems  ore  discussed#  and  it  is  shown 
how  the  PAPI  system  test  meets  these  needs,  HJB 


Fig. 5  Extract  from  Defence  Research  Abstracts  (DRA) 


SUBJECT  INDEX 


Subject  Heading  &  Title  of  Report 


Access  Ho  Location 


LANCE(MISSILE) 

FLIGHT  TESTS 

Preliminary  Date  Report  (OulcL  Look)  Loose  Flight  Teats  No  276  Mlaalle  3667  Mlsalon  NMFI-1 
Preliminary  Data  Report  (Quick  Look)  Lance  Flight  Teats  No  279  Mlaalle  307S  MUsion  ASP  60 

LANDING 

APPROACH 

Technloue  (or  Lending  From  Stcop  Gradient  Approaches  Using  a  Medium  Slto  Military  Tranaoor 

t  LANOING  AI0S 

PAPI (LANDING  AID) 

Precision  Approach  Path  Indicator  -  PAPI 

LAP  J0INI3 

ADHESIVE  BONDING 

Influence  of  Various  Pretreatments  on  Lepjoint  Strength 

Primary  Adhaaively  Bonded  Structure  Technology  Teat  Report  General  Materiel  Property  Date 
MECHANICAL  PROPERTIES 

_ Primary  Adhesively  Bonded  Structure  Technology  Test  Report  Genera)  Property  Data _ 


P-2ISS52  21 70*7701 
P-215690  217S-7701 


8R-97589  2001-7702 


8R-50759  2001-7701 


P-217389  2053-7705 
P-216086  2071-7702 


P-216089  2069-7702 


Fig.6  Extract  from  Cumulative  Subject  Index  to  DRA 


9S5W!*3Gwq<«*W«?? 
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3.1  The  SD1  Profile 

The  interests  of  the  user  must  be  translated  into  a  form  which  can  be  used  by  the  Information  Centre  iO  match 
against  the  material  available  so  that  only  what  is  relevant  is  supplied.  This  is  the  user’s  ‘profile’.  When  SDI  is  to  be 
supplied  to  an  individual  scientist  the  first  step  in  compiling  his  profile  is  to  obtain-a  statement  from  him  of  his  interests 
in  his  own  words  This  is  facilitated  if  a  standard  form  is  used  which  siiouid  also  include  space  for  him  to  suggest  key¬ 
words  and  to  include  references  to  known  relevant  documents.  If  the  latter  have  been  supplied  by  the  Centre,  the 
indexing  terms  assigned  to  them  together  witi.  the  keywords  translated  into  the  form  used  in  the  Centre’s  indexing 
system  will  give  a  preliminary  list  of  te  j  from  which  the  profile  can  be  built  up.  For  manual  systems  a  fairly  simple 
profile  is  sufficient  Computer  processing  allows  of  more  sophistication,  and  a  formal  example  of  using  descriptors  from 
TEST  for  profile  construction  will  be  given  in  paragraph  3.4.2. 

3.2  Definitions 

Since  SDI,  especially  when  using  a  computer,  involves  information  retrit.al,  definitions  of  a  few- commonly  used 
terms  are  appropriate.  These  are  natural  language  and  controlled  vocabulary  indexing,  and  precision  and  recall  in 
retrieved  information.  In  natural  language  (free  text)  indexing,  the  indexing  terms  assigned  are  taken  from  the  original 
text  of  the  document.  In  controlled  vocabulary  indexing  the  indexing  teams  are  taken  from  a  fixed  list  of  terms  or 
hesaurus  such  as  TEST.  In  assessment  of  retrieved  output  precision  is  the  ratio 

number  of  relevant  items  retrieved 

- - -  - - - - —  x  100 

total  number  of  items  retrieved 

recall  is  the  ratio 

number  of  relevant  items  retrieved 

- - - - - - — - - ; - x  100 

total  number  of  relevant  items 

3.3  Manual  Selective  Dissemination  of  Information 

New  documents  received  in  an  Information  centre  can  be  given  an  initial  distribution  to  those  most  interested  in 
the  subject  if  a  register  of  customers’  interests  is  maintained.  Such  a  register,  often  called  a  Field  of  Interest  Register, 
normally  consists  of  subject  headings  followed  by  abbreviations  or  codes  indicating  the  organisations  or  individuals 
interested  in  the  subject.  It  can  be  organised  in  different  ways  according  to  the  method  of  subject  indexing  used  in 
the  Centre.  A  centre  using  TEST  will  probably  find  ordering  by  COSATI  subject  categories  the  most  convenient,  but 
numerical  codes,  e.g.  the  Universal  Decimal  Classification,  or  alphabetical  subject  headings,  have  also  been  used.  An 
example  of  an  Interest  Register  using  alphabetical  subject  headings  is  given  by  Wright16  who  describes  the  Royal  Aircraft 
Establishment  Library’s  system.  It  consists  of  subject  headings  in  alphabetical  order  with  sub-headings  followed  by  codes 
showing  Depaitments  and  Divisions  within  Departments.  It  was  originally  prepared  from  a  master  index  maintained  on 
strip  index  panels  from  which  new  editions  were  produced  by  photography  and  offset  printing,  but  its  production  has 
now  been  computerised. 

The  increasing  use  of  microfiche  enables  new  documents  to  be  selectively  distributed  in  this  form,  with  consequent 
savings  in  waiting  time  and  costs  for  postage  and  packing.  A  good  example  of  such  a  service  is  that  provided  by  NTIS  in 
SRIM  (Selected  Research  in  Microfiche),  in  which  an  automatic  distribution  takes  place  every  two  weeks  of  microfiche 
reports  in  subject  areas  selected  by  the  customer. 

Manual  SDI  systems  in  which  information  about  document,  m  ,r  than  the  documents  themselves  is  distributed 
are  also  feasible,  although  in  many  cases  these  are  very  similar  to  a  Centre’s  literature  bulletins  already  discussed  in 
paragraph  2  1 .  Selection  of  items  is  often  bused  on  simple  profiles  constructed  for  groups  of  users  interested  m  inter¬ 
disciplinary  subjects.  Roysdon  and  Mistichelli17  describe  a  system  developed  at  Lehigh  University  for  the  interdisciplinary 
problems  of  Energy,  Food,  and  Manpower/Womanpower.  Abstracts  on  these  subjects  are  copied  from  abstracting 
journals  and  primary  sources  by  library  staff  to  produce  monthly  continuing  bibliographies.  Circulation  is  to  50-80 
recipients  and  the  system  is  claimed  to  be  cost  effective  and  to  have  stimulated  the  use  of  library  resources.  Another 
manual  system  which  involves  building  up  individual  profiles  has  been  developed  at  the  University  of  Aston,  Birmingham, 
UK,  and  is  described  by  Vincent  and  Seals18.  The  service  is  based  mainly  on  Current  Contents  and  other  abstracting 
journals  and  serves  ten  groups  of  staff  and  research  students  in  Pharmacy,  Applied  Psychology,  Chemical  Engineering, 
Management,  and  Production  Engineering.  References  are  copied  on  to  cards,  each  card  marked  with  profile  code  and 
library  call  number  and  a  typed  list  prepared  from  the  cards.  Xerox  copies  of  the  lists  are  sent  to  the  users. 

The  Scientific  Documentation  Centre,  Ltd  at  Dunfermline,  Scotland,  operates  a  commercial  manual  SDI  service. 

The  Director,  Dr  P.S. Davison19  describes  the  service  as  covering  a  broad  subject  field  including  spectroscopy,  analytical 
chemistry,  pollution,  computer  science,  human  and  information  sciences  and  librarianship.  This  involves  scanning  by 
part-time  scientific  staff  annually  3250  journals,  UK  and  US  openly  available  government  reports,  UK  and  US  thesis 
titles  and  14000  book  titles.  A  classification  system  devised  by  the  Centre  is  used  (Dr.vidson20).  Output  is  provided 
in  the  form  of  125  x  75  mm  cards  distributed  weekly,  containing  bibliographical  details  but  not  abstracts. 
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3.4  Computer-based  Sl)l 

Information  retrieval  using  computer  systems  is  dealt  with  in  detail  in  Section  7,  Vol.IIi  of  this  Manual.  It  is 
sufficient  to  mention  here  that  an  SOI  computer  progiam  will  usually  be  very  similar  to  the  information  retrieval  program 
used  by  the  Centre  with  possible  modifications  to  the  print  program.  The  data  base  searched  will  normally  be  the  latest 
update  to  the  main  data  base,  containing  the  input  from  one  or  two  issues  of  the  Centre's  announcement  bulletin,  but  it 
is  possible  to  obtain  other  data  bases  by  purchase  or  lease  to  enhance  the  service,  and  this  possibility  will  be  discussed  in 
paragraph  3  1  A  monthly  SOI  service  is  frequently  provided.  A  service  at  longer  intervals  defeats  the  objective  of 
immediacy,  and  unless  the  Centre  has  a  very  large  input,  or  uses  externally-produced  tapes  to  supplement  the  service,  a 
fortnightly  SOI  will  produce  only  a  few  references. 

3  4. 1  Compiling  Pro files  for  the  Computer 

For  computer  searching  the  search  profile  derived  from  the  user's  interest  statement  must  be  an  ordered  list  of  terms 
in  a  form  strictly  compatible  with  the  indexing  language  of  the  data  base  to  be  searched  and  the  search  strategy  used  by 
the  computer  scarcli  program.  The  terms  (descriptors)  are  sometimes  all  single  words,  as  in  many  natural  language 
systems  or,  as  in  TFST,  single  words  and  words  combined  to  form  concepts.  Individual  descriptors  must,  however,  be 
coordinated  to  match  the  more  complicated  concepts  of  the  user’s  interest  statement.  The  method  of  coordination 
frequently  used  is  the  employment  of  Boolean  logic  operator-  There  are  three  of  these  operators  commonly  used,  viz.- 


Operator 

Symbol 

Example  of  use 

Meaning 

AND 

* 

(1  *2) 

Select  only  documents  that  are  indexed 
under  both  term  1  and  term  2 

OR 

+ 

(1  +2) 

Select  documents  indexed  under 
cither  term  1  or  term  2 

NOT 

(1  ~2) 

Select  document  indexed  under 
term  1  but  discard  any  indexed  also 
under  term  2 

The  processes  can  also  be  shown  in  the  form  of  Venn  diagrams  as  illustrated  in  Figure  7  where  the  circles  represent 
sets  of  documents  indexed  by  descriptors  A  and  B  respectively.  The  shaded  areas  represent  (a)  documents  indexed  both 
by  A  and  by  B,  i.c.  A  *  B,  (b)  documents  indexed  cither  by  A  or  by  B,  i.e.  A  +  B,  (c)  documents  indexed  by  A  but  not 
by  B,  i.c.  A  -  B. 

3.4.2  Example  of  Profile  Compilation  using  TEST  Descriptors 

As  an  example  of  profile  compilation  using  controlled  language,  TFST  descriptors  in  this  case,  the  following  steps 
can  be  recommended.  Information  is  received  from  the  user  as  follows: - 

Statement  of  interest  Use  of  carbon  or  glass  fibre  composite  materials  in  aircraft  structures 

Relevant  do  'ument(s)  Impact  of  composite  materials  on  aerospace  vehicles  and  propulsion  systems. 

AGARD  Structures  and  Materials  Panel  and  Propulsion  and  Energetics  Panel 
Symposium.  September  1972,  AGARD-CP-1 12. 

AG  ARP  CP  1 1 2  consists  of  a  number  of  papers,  and  referring  to  the  original  indexing  of  these  papers  v  lien  processed  in 
the  Centre  it  is  found  that  the  following  descriptors  which  appear  relevant  to  the  profile  have  been  assigned. 

Terms  relevant  to  aircraft  structures 

Aircraft  panels 

Airframes 

Terms  relevant  to  composite  materials 
Composite  fabrication 
Composite  materials 
Composite  structures 
Fiber  composites 
Sandwich  structures 

Terms  relevant  to  carbon  or  glass  fibres 
Carbon  fibers 

Fiberglass  reinforced  plastics 

(It  should  he  noted  that  the  TEST  spelling  of  descriptors  is  followed.)  To  make  the  profit-  more  complete  it  is 
necessary  to  refer  to  these  terms  in  TEST,  and  consider  for  selection  some  of  the  broader,  narrower  and  related  terms 
which  arc  displayed.  This  results  in  selection  <T  further  terms  as  follows: 


Ill 


A  AND  11  (A  *  m 


A  OR  1)  (A  +  m 


A  NOT  I)  (A  in 


Fig.?  Venn  diagrams 


Term  relating  to  airemjt  structures 

Control  surfaces 

Fuselages 

Landing  gear 

Tail  assemblies 

Flaps  (control  surfaces') 

Tv*tus  relating  to  composite  materials 
Chopped  fiber  composites 
Oriented  fiber  composites 

Terms  relating  to  carbon  or  glass  fibres 
Fiberglass  reinforced  plastics 


'Fiberglass  reinforced  plastics'  defines  a  concept  of  a  composite  materia!  using  glass  fibre.  The  same  concept  can  be 
achieved  by  combining  'glass  fibers'  with  the  descriptor*  relating  to  composite  materials  using  the  A  NO  operator.  Using 
AND  and  OR  operators  the  profile  is  built  up  line  by  line  as  follows: 


PROFILE 

Line  No.  Term  or  logic  equation  Notes 


1 

2 
2 

4 

5 

(i 

? 

S 

0 


Aircraft  panels 

Airframes 

Control  surfaces 

Fuselages 

Landing  gear 

Tail  assemblies 

Flaps  (control  surfaces) 

Fins 

I  8/+ 


Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
Complete  descriptor 
I  OR  2  OR  3  OR  . . , 


OR  8 
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PROFILE 


Line  No. 

Term  or  logic  equation 

Notes 

10 

Carbon  fibers 

Complete  descriptor 

11 

Glass  fibers 

Complete  descriptor 

12 

10+11 

10  OR  1 1 

13 

Sandwich  structures 

Complete  descriptor- 

14 

Composite  structures 

Complete  descriptor 

15 

Composite  fabrication 

Complete  descriptor 

16 

Composite  materials 

Complete  descriptor 

17 

Fiber  composites 

Complete  descriptor 

18 

Chopped  fiber  composites 

Complete  descriptor 

19 

Oriented  fiber  composites 

Complete  descriptor 

20 

13-19/+ 

13  OR  14  OR  IS  OR... OR  19 

21 

12*  20 

12  AND  20 

22 

Fiberglass  reinforced  plastics 

Complete  descriptor 

23 

21  +  22 

21  OR  22 

24 

9  *  23  -  PRINT 

The  final  line  of  the  profile,  24,  results  in  a  printout  of  references  meeting  the  requirements  of  the  ‘interest 
statement’. 

Additional  search  facilities  arc  usefully  incorporated  in  an  SD1  computer  program  provided  the  data  base  is  suitably 
structured  so  that  data  elements  required  to  be  searched  arc  identified,  Such  additions  include  author  names  and 
COSATI  fields.  Each  of  these  elements  would  be  added  to  the  profile  and  treated  in  tin  same  way  as  the  descriptors  in 
the  logic  equations. 

Economy  in  profile  compilation  can  be  achieved  by  truncation.  A  right  hand  truncation,  represented  often  by  a 
question  mark,  is  exemplified  by  ‘comput?’.  Such  a  descriptor  input  would  involve  a  search  on  all  terms  beginning 
‘comput’  which  in  TEST  arc: 

Computation 
Computational  linguistics 
Computer  components 
Compuicr  driven  punches 
Computerized  simulation 
Computer  logic 
Computer  personnel 
Computer  programming 
Computer  programs 
Computers 

Computer  storage  devices 
Computer  systems  hardware 
Computer  systems  programs 
Computer  gun  sights 

lly  a  similar  process  left  hand  truncation  is  represented  by  "? planes',  which  would  bring  out  'Airplanes’,  'Rocket 
planes’,  'Seaplanes'.  The  facility  has  to  be  used  with  caution  as  false  drops  can  be  obtained.  In  the  'comput'?’  example 
given  above,  for  instance,  'computing  gun  sights’  would  be  irrelevant  in  a  profile  designed  to  provide  SOI  on  application 
of  computers  in  information  retrieval. 

In  controlled  vocabulary  indexing  systems  economy  in  profile  construction  is  also  obtained  where  the  computer 
program  used  at  the  data  input  stage  automatically  posts  documents  indexed  under  specific,  narrower  terms  to  the 
broader,  generic  terms.  In  such  systems  the  profile  can  consist  of  broader  terms  only,  as  the  use  of  these  terms  will  also 
retrieve  the  items  indexed  under  the  associated  narrower  terms.  In  the  example  profile  given  above,  'chopped  fiber 
composites'  and  'oriented  fiber  composites’  arc  narrower  terms  both  referred  to  ‘fiber  composites’  as  the  broader  term, 

If  the  'posting  up’  procedure  is  followed  at  the  input  stage  use  of  ‘fiber  composites'  in  the  profile  will  retrieve  documents 
indexed  under  ‘chopped  fiber  composites’  or  ’oriented  fiber  composites’,  and  these  narrower  terms  can  be  omitted  from 
the  profile.  The  procedure  will  generate  some  ‘noise’,  since  documents  indexed  under  any  narrower  term  to  a  given 
broader  term  will  be  retrieved,  and  in  some  cases  only  a  proportion  of  the  narrower  tr.-ns  will  be  relevant  to  the  profile. 

3,4.3  Profile  Compilation  using  Natural  Language 

A  natural  language  profile  can  need  more  intellectual  effort  than  a  controlled  vocabulary  profile,  as  it  is  necessary 
to  include  all  variations  of  a  desired  term  in  the  profile.  These  variations  include  spelling,  e  g,  color/colour,  computerise/ 
computerize,  sulphate/sulfatc;  and  synonyms,  e.g.  columbium/niobium.  For  chemical  subjects  all  alternative  names  for 
relevant  ehcmical  compounds  must  be  included,  A  preliminary  study  of  the  vocabulary  used  in  relevant  documents  is 
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lielpful  in  identifying  synonyms.  The  profile  usually  consists  of  single  terms,  concepts  being  built  up  using  the  Boolean 
AND  operator.  Truncation  is  frequently  employed,  and  other  data  fields,  such  as  author  names  and  classification  codes 
are  often  included  in  the  profiles.  The  main  stages  in  profile  construction  are  normally  similar  to  the  example  ‘TEST’ 
profile  given  in  paragraph  3.4.2. 

3.4.4  Ollier  Methods  of  Profile  Construction 

Although  profiles  arc  usually  built  up  using  words,  it  is  possible  to  use  numerical  codes.  Thus  Brophy21  describes 
a  system  developed  at  Tccside  Polytechnic  for  informing  library  users  of  new  acquisitions  within  their  subject  interests. 

The  profiles  are  built  up  using  Dewey  Decimal  Classification  numbers. 

The  search  strategy  need  not  necessarily  involve  Boolean  logic.  The  terms  in  the  profile  can  be  given  weights 
(e.g.  from  I  to  10)  according  to  their  relative  importance  to  the  search.  The  weights  of  all  terms  in  the  profile  which 
match  with  terms  in  document  records  in  the  data  base  are  summed  to  produce  a  document  score.  All  document 
citations  scoring  above  a  given  threshold  level  are  printed  out  as  the  SD1  output. 

Evans52  reports  research  on  various  search  strategics  as  part  of  the  1NSPEC  programme.  Strategies  ranged  from 
simple  coordinate  matching  of  terms  and  groups  of  terms  (concepts)  to  the  use  of  term  weights  and/or  group  weights, 
and  Boolean  logic  with  and  without  weights.  The  best  retrieval  performances  were  given  by  strategies  using  weighting 
techniques  and  the  most  cost  effective  strategy  overall  was  one  which  used  the  coordinate  matching  of  a  restricted  list  of 
terms  with  weights. 

Farradanc’s  system  of  relational  indexing  has  been  used  experimentally  for  profile  construction  (Yatcs-Mercer23). 
Forty-three  profiles  were  built  up  of  concepts  or  groups  of  ■'oncepts  linked  by  relational  operators.  Tested  against  a  data 
base  of  2S29  abstracts  from  Metals  Abstracts,  the  profiles  gave  an  output  having  both  precision  and  recall  of  75'A 

3.4.5  Alt  town  tie  Profile  Construction 

Profile  construction  is  labour  intensive  and  uses  highly  trained  staff  It  is  tempting  to  ask  if  profiles  can  be  generated 
automatically  by  computer,  or  if  computer  assistance  can  cut  the  cost  of  intellectual  effort.  Some  experimental  work  on 
these  lines  has  been  reported,  although  results  to  date  do  not  lead  to  firm  conclusions.  Evans  and  Gould2'*  report  an 
investigation  using  the  INSPEC  thesaurus  in  which  lists  of  natural  language  terms  supplied  by  research  workers  were 
matched  against  the  thesaurus.  Three  further  profile  versions  were  derived  automatically  by  the  addition  in  turn  of 
(1)  synonyms  (2)  narrower  terms  and  (3)  selected  related  terms.  The  profiles  and  their  modifications  were  run  against 
two  document  collections.  The  automatically  generated  profiles  performed  slightly  worse  than  profiles  compiled 
subjectively  from  the  same  set  of  user  statements. 

The  United  Kingdom  Chemical  Information  Service  (UKCIS)  has  carried  out  investigations  using  the  free  text  file 
Chemical  Abstracts  Condensates  (CAC).  The  work  is  reported  in  two  papers  (Barker  et  al.  and  Robson  and  Longman)25*26, 
The  usefulness  of  a  potential  search  term  was  quantified  in  terms  of  its  specificity,  a  measure  of  the  frequency  of  its 
occurrence  in  relevant  material  against  its  frequency  of  occurrence  in  the  entire  data  base.  Computer  analysis  of  the  text 
of  relevant  items  from  a  fixed  file  of  three  issues  of  CAC  was  used  to  produce  lists  of  terms  in  order  of  specificity,  those 
above  a  given  specificity  value  being  added  to  the  profile  and  the  process  repeated  iteratively  until  no  new  relevant  items 
were  retrieved.  The  specificity  lists  were  converted  to  standard  search  profiles  which  were  compared  against  a  different 
fixed  file  of  CAC  with  conventionally  produced  profiles.  The  automatically  produced  profiles  were  cheaper  to  construct 
and  performed  similarly  to  the  conventional  profiles,  although  each  type  produced  relevant  items  which  the  other  missed. 

3.5  Output  Assessment 

When  the  results  of  the  first  computer  run  of  an  SDI  profile  are  submitted  to  the  user,  an  assessment  form  should 
accompany  the  printout  so  that  he  can  indicate  the  number  of  relevant,  possibly  relevant,  and  irrelevant  references. 

These  should  be  analysed  by  the  compiler  of  the  profile  to  investigate  the  reason  for  retrieval  of  irrelevant  references. 

The  data  base  searched  should  also  be  studied  to  find  out  if  any  relevant  reports  have  been  missed.  The  profile  is 
modified  if  necessary  and  an  assessment  made  of  the  modified  profile.  These  steps  should  be  repeated  until  the  number 
of  irrelevant  references  is  reduced  to  an  acceptable  level.  With  SDI  it  is  more  important  to  have  high  recall  than  high 
precision  in  the  response  to  the  profile.  A  profile  with  high  recall  is  likely  to  retrieve  items  which  the  user  would  not 
find  for  himself  but  which  he  will  find  of  interest.  A  high  precision  profile  might  only  retrieve  items  already  known  to 
him. 

3.6  User  Liaison 

For  the  best  results  from  an  SDI  service  the  users  should  be  encouraged  to  take  an  active  interest.  Information 
scientists  in  a  Centre  serving  a  local  community  such  as  a  research  establishment  can  be  in  frequent  touch  with  the 
scientists  receiving  the  SDI  output.  Such  contacts  can  be  on  an  informal  basis,  but  a  more  formal  approach  is  described 
by  Butterly2’  of  the  National  Nuclear  Research  Centre.  Pretoria.  South  Africa,  where  structured  interviews  with  users 
were  employed  in  constructing  and  improving  search  profiles  in  an  SDI  system  based  on  Atomindex  tapes  from  the 
International  Nuclear  Information  System  (INIS)  which  uses  a  controlled  language  thesaurus. 
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Where  an  Information  Cent/?  serves  a  scattered  community  of  users  personal  contacts  may  be  more  difficult.  In 
such  cases  it  is  important  to  compile  a  guide  to  the  system  which  is  circulated  to  potential  users.  An  example  of  such  a 
guide  is  that  published  for  the  UK  Department  of  Industry’s  Technology  Reports  Centre  by  Adams  ct  al.38.  This  gives 
details  of  TRC  indexing,  use  at  TEST  for  descriptors,  CCSATI  codes,  and  preparation  of  search  profiles  with  examples- 
TRC  profiles  are  based  on  the  TRC  data  base  and  N'TIS  and  INSPEC  magnetic  tapes.  Burton39,30  describes  the  system 
operated  by  the  US  Department  of  Agriculture's  Technical  Information  Systems  (formerly  the  National  Agricultural 
Library).  An  SDi  service  serves  over  1 600  scientists  from  nine  data  bases.  The  users  compile  and  modify  their  own 
profiles  with  the  aid  of  a  User’s  Guide  and  tutorial  seminars.  Another  method  of  assisting  users  in  large  scale  SDI  services 
is  employed  in  the  Canadian  CAN/SDl  service  run  by  the  Canada  Institute  for  Scientific  and  Technical  Information, 
where  local  librarians  and  information  scientists  are  trained  as  search  editors31. 

3.7  Using  Commercial  Data  Bases  for  In-house  SDI 

An  SDI  service-  based  on  the  Centre’s  own  input  can  be  supplemented  by  running  magnetic  tapes  of  commercial 
data  bases  on  the  Centre’s  own  computer.  These  tapes  can  be  obtained  by  purchase  or  more  usually  by  a  leasing  arrange¬ 
ment.  sometimes  with  an  additional  royalty  according  to  the  number  of  items  retrieved  during  an  SDI  run.  Some 
programming  effort  will  b;  required  in  the  Centre  to  reformat  the  tapes  to  make  them  compatible  with  the  Centre’s 
search  program,  and  SDI  profiles  may  have  to  be  rewritten.  A  profile  written  using  a  controlled  vocabulary  will  need 
considerable  modification  if  it  is  required  to  be  matched  against  a  free  text  file.  Some  examples  of  successful  operation 
will  show  how  the  problems  of  using  external  data  bases  have  been  overcome. 

In  Australia  the  Aeronautical  Researcii  Laboratories  introduced  a  computer-based  SDI  service  for  report  literature  in 
1  I ,  searching  the  data  bases  of  the  Australian  Defence  Science  and  Technology  Information  System  (ADSATIS).  and 

leasing  the  NTIS  magnetic  tape.  ADSATIS  acquisitions  were  already  indexed  using  TEST  descriptors.  On  the  NT1S 
tapes  not  all  descriptors  used  were  from  TEST,  and  a  retroactive  thesaurus  was  created  and  continuously  updated.  All 
descriptors  on  the  ADSATIS  tapes  were  searched,  as  were  all  asterisked  descriptors  on  the  NTIS  tapes.  A  simple  matching 
process  using  OR  logic  was  employed,  although  this  was  extended  later  by  using  the  facility  of  AND  with  any  number  cf 
COSATI  cateogrics  with  each  descriptor.  Output,  originally  in  the  form  of  fortnightly  individual  computer  printouts  was 
later  issued  on  continuous  line  printer  perforated  stationery,  each  item  displayed  being  addressed  to  the  recipient,  the  left 
card  of  «ach  pair  confining  full  bibliographical  details,  including  abstract  where  available,  and  the  right  hand  card  being 
preaddressed  to  the  library  for  loan  request  purposes.  Sheppard31 33  describes  the  development  of  the  system  for  a 
I’DIMO  computer,  six  man-weeks  of  analysis  and  programming  being  required  to  start  the  service.  After  a  few  years 
experience  a  questionnaire  completed  by  users  showed  that  96T  of  them  considered  the  se  vice  to  be  of  high  or  medium 
relative  importance  to  them  as  a  source  of  information. 

Commercial  data  bases  of  published  literature  are  also  used  by  many  Centres  to  provide  SDI  services  to  supplement 
or  replace  current  awareness  literature  lists  prepared  in  the  Centre.  The  library  of  the  Uv  Atomic  Weapons  Research 
Establishment  has  had  many  years  experience  in  providing  such  a  service.  The  development  of  the  AWRE  service  has 
been  described  by  Corbett34’35,36  and  Friend33  starting  with  the  Chemical  Abstracts  Service  Chemical  Titles  magnetic 
tape,  later  supplemented  by  INSPEC  and  Nuclear  Science  Abstracts  tapes.  The  computer  program  used  was  that 
developed  by  the  National  Research  Council  of  Canada  for  the  IBM  360/50  computer.  Profiles,  including  truncated 
terms,  are  matched  against  title  words.  Author  names  and  journal  titles  are  used  in  some  profiles,  and  subject  codes  also 
in  INSPEC  and  NSA  profiles.  Benefits  to  the  library  have  included  reducing  the  necessity  of  multicopy  purchasing  of 
expensive  primary  and  abstracting  journals,  and  users’  time  in  eover-to-oover  scanning  of  journals  has  been  reduced, 

3.8  Standard  Profiles 

The  production  of  personal  profiles  is  a  labour-intensive  process,  and  the  benefits  of  computer-based  SDI  can  be 
obtained  more  cheaply  by  producing  Standard  Profiles.  Standard  Profiles  are  compiled  by  information  scientists  on 
topics  of  interest  to  groups  rather  than  to  individuals.  Each  profile  is  normally  rather  broader  anil  simpler  than  a  personal 
profile  but  the  compilation  process  is  similar.  A  number  of  information  services  produce  both  personal  and  standard 
profiles. 

A  good  example  of  a  Standard  Profile  sendee  is  that  provided  commercially  by  tire  European  Space  Agency  Informa¬ 
tion  Retrieval  Service  in  its  Standard  Title  scries.  These  arc  issued  monthly  by  computer  search  of  the  NASA/I  A  A  data 
base.  The  titles  available  arc  shown  in  Appendx  1  (Information  from  ESA  IRS,  Via  Galileo-Galilei,  00044  Frascati. 

Italy).  In  the  United  States  the  NASA  SCAN  (Selected  Computer  Aerospace  Notices)  sendee  provides  a  similar  twice- 
monthly  pioduct  from  this  data  base.  Standard  profiles  based  on  the  INSPEC  data  base  are  the  INSPEC  weekly  Topics, 
covering  published  material  in  the  fields  of  optics,  quantum  optics  and  particle  optics,  condensed  matter  physics, 
geophysics,  devices  and  techniques  in  physics,  power  engineering,  communications,  electronics,  computers  and  computing, 
control.  (Information  from:  INSPEC,  The  Institution  of  Electrical  Engineers,  Savoy  Place.  London.  WC2  0BL.)  In 
France  the  Centre  de  Documentation  Scientifiquc  ct  Technique  offer  monthly  Standard  Profiles  from  the  PASCAL  data 
base.  The  profiles  are  compiled  by  information  scientists  in  the  Centre  and  will  eventually  cover  1000  titles.  The  output 
is  computer  printed  four  items  to  a  page  which  is  perforated  so  that  individual  items  can  be  filed  as  150  x  110  mm  cards. 
Eacli  item  gives  bibliographical  details  and  includes  an  abstract.  (Information  from  lnfonnascience,  Centre  de 
Documentation  Scicntifique  et  Technique,  Centre  de  la  Recherche  Scicntifique,  27  rue  Boyer,  75971  Paris  Cedcx  20.) 
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3.9  Supply  of  Documents  from  SDI  Printouts 

An  SDI  service  based  on  the  Information  Centre’s  own  data  base  will  provide  references  to  documents  held  in  the 
Centre,  and  the  SDI  printouts  can  incorporate  a  request  form  for  the  user  to  mark  up  and  return.  When  an  SDI  service 
also  incorporates  commercial  data  bases,  the  problem  becomes  complicated  and  the  Centre’s  staff  or  the  users’  local 
librarians  may  find  it  time-consuming  to  locate  the  full  text  of  some  of  the  items  cited  on  the  commercial  tapes.  A 
partial  solution  to  this  problem  is  suggested  by  Bourne38  at  the  University  of  California  which  has  a  geographically 
dispersed  library  system  and  runs  an  SDI  service  from  tapes  including  CA  Condensates,  B10S1S  Previews,  ERIC,  and 
CAIN  (AGRICOLA).  A  computer  file  was  built  up  of  journal  title/location/call  numbers.  Output  citation  tapes  from 
each  SDI  profile  were  run  against  the  journal  file  as  part  of  the  output  printing  run  and  the  library  location  was  printed 
out  after  the  citation.  Such  a  system  could  be  adapted  to  any  Centre’s  needs  if  a  union  catalogue  exists  of  journals  held 
by  libraries  which  cooperate  in  an  inter-lending  scheme  with  the  Centre  operating  the  SDI  sendee. 

3.10  Costs  and  Staffing  Requirements  of  SDI 

It  will  be  clear  from  the  preceding  paragraphs  that  there  are  many  factors  affecting  the  cost  of  an  SDI  sendee. 
Whether  external  tapes  are  purchased  or  leased,  or  whether  the  sendee  is  provided  solely  from  the  Centre’s  own  data 
base,  the  amount  of  work  in  profile  construction  expected  from  the  user,  and  whether  personal  or  standard  profiles 
or  both  arc  provided  are  some  of  the  main  considerations.  Guidance  on  the  subject  can  best  be  given  by  indicating  some 
published  surveys  which  should  be  consulted  fer  further  details. 

Information  scientists’  time  in  profile  construction  has  been  discussed  by  Evans”  in  the  paper  already  cited  in 
paragraph  3.4.4.  The  method  used  for  profile  construction  has  a  considerable  influence  on  the  time  taken.  Compiling 
and  modifying  profiles  using  coordinate  term  matching  averaged  90  minutes  per  profile.  Using  Boolean  logic  with 
weights  the  time  was  168  minutes.  Translated  into  yearly  output  an  information  scientist  could  handle  about  1000 
profiles  using  coordinate  matching,  or  about  550  using  Boolean  logic  with  weights.  Evans  also  found  that  profiles  using 
a  controlled  vocabulary  were  compiled  more  quickly  than  those  using  free  text,  llisinger39  in  describing  the  service 
provided  by  the  National  Technological  Library  of  Denmark  on  profiles  run  against  the  COMPHNDEX  and  IN  SPEC 
magnetic  tapes  estimates  that  profile  construction  takes  2  4  hours  of  an  information  scientist’s  time,  with  user 
cooperation. 

Using  external  data  bases  introduces  an  extra  cost.  Kabi40  of  the  United  Kingdom  Chemical  information  Services 
(UKC1S)  gives  an  analysis  of  costs  (1972)  of  providing  SDI  from  the  CA  Condensates  magnetic  tapes.  llisingcr3\  cited 
above  gives  a  detailed  breakdown  of  ( 1 97 1 )  costs  which  include  a  considerable  element  for  procuring  original  material 
as  ‘information  which  is  not  accessible  is  of  no  practical  use  to  the  subscriber’.  Multi-data  base  centres  specialising  in  SDI 
have  been  reviewed  by  Zais41  who  suggests  that  costs  tend  to  be  distributed  as  follows:  computer  processing  costs  take 
approximately  30%  of  the  budget,  personnel  costs  about  60%  and  data  base  acquisition  by  purchase  or  lease  about  10%. 
Vickers43  ha' collected  cost  data  from  18  operational  computer-based  systems  in  Europe  and  the  USA  using  a  structured 
cost  analysis  scheme.  He  tabulates  SD!  costs  per  user  and  unit  costs  of  SDI  processing.  The  figures  show  considerable 
variation. 


4.  BIBLIOGRAPHIES 


The  stimulus  for  preparing  formal  bibliographies  can  arise  in  various  ways.  An  Information  Centre  may  be  asked 
to  provide  a  bibliography  before  a  research  project  is  undertaken,  to  establish  wlnt  work  has  already  been  carried  out  on 
the  subject.  The  Centre  may  receive  requests  for  literature  searehes  on  similar  subjects  from  different  customers  which 
indicate  a  widespread  interest  in  a  new  development.  The  information  specialists  in  the  Centre  may  consider  that  a 
development  is  of  such  importance  that  they  should  anticipate  future  demands  lor  information  and  prepare  a  biblio¬ 
graphy  on  the  subject. 


Before  beginning  the  work  it  must  be  established  what  period  is  to  be  covered,  and  what  type  ol  publication  is  to 
be  cited,  such  as  openly  published  literature,  unpublished  reports,  or  both.  Th„  literature  and  reports  collection  of  the 
Centre  is  the  first  source  of  the  references  to  be  included.  If  the  subject  is  already  included  in  the  Centre's  computerised 
SDI  service,  items  can  be  cut  from  the  printouts  to  provide  the  nucleus.  Otherwise  suitable  abstracts  from  the  Centre's 
announcement  bulletin  arc  copied.  It  is  seldom  possible,  however,  to  complete  a  bibliography  using  in-house  lesouiccs. 
and  other  available  sources  must  be  considered.  Once  some  key  papers  are  identified,  the  lists  of  references  in  them  can 
be  consulted  and  each  citation  used  as  a  source  for  further  retrospective  search.  The  published  Science  Citation  Index 
(Institute  for  Scientific  Information,  325  Chesnut  Street.  Philadelphia,  Pa,  19106,  USA)  in  which  tournal  articles 
citing  a  given  reference  are  indexed,  is  helpful  for  this  process.  Centres  with  on-line  access  to  systems  such  as  1  ockhced 
DIALOG,  can  consult  the  1S1  data  base  SC1SEARCI1® .  An  interactive  on-line  search  to  other  data  bases  will  produce 
additional  references,  and  an  output  format  which  includes  abstracts  will  give  complete  bibliographical  entries  without 
necessitating  reference  to  the  original  source.  For  openly  available  report  literature  the  data  base  provided  by  the  US 
National  Technical  Information  Service  (NTIS)  based  on  their  Government  Reports  Announcements  (GK  V )  is  a  good 
source,  and  NTIS  information  specialists  themselves  prep  >re  bibliographies  based  both  on  the  NTIS  ami  on  the 
Engineering  Index  data  bases  under  the  general  title  NTlSearch,  fhese  bibliographies  are  advertised  in  UR  V 


tk> 

For  aerospace  report  literature  the  NASA  STAR/1  A  A  ilnta  base  is  of  particular  importance,  end  this  is  available 
on-line  via  the  NASA  RFCON  (REmotc  CONsole)  system  ir.  the  USA  end  in  Europe  on  the  European  Space  Agency’s 
Information  Retrieval  Service’s  RECON  system.  NASA  issues  a  number  of  continuing  bibliographies  drawn  from  this  data 
base  which  illustrate  its  value  as  a  source  for  bibliographies.  These  include: 

Aeronautical  Engineering  (NASA  SP-7037)  monthly 
Aerospace  Medicine  and  Biology  (NASA  SP-701 1)  monthly 
Earth  Resources  (NASA  SP-7Q4 1 )  quarterly 
Energy  (NASA  SP-7043)  quarterly 
Management  (NASA  SP-750G)  annually. 

They  are  available  for  purchase  from  NTIS. 

The  format  of  the  bibliography  must  be  carefully  considered.  The  citations  will  normally  be  in  the  same  format  as 
those  in  the  Centre’s  bulletin.  Citations  taken  front  other  sources  are  best  reformatted,  and  the  whole  collection  retyped 
and  reproduced  by  offset  lithography.  An  extensive  bibliography  can  usefully  be  grouped  into  subject  sections  and 
contain  a  separate  author  index.  A  shorter  bibliography  on  a  very  specific  subject  can  be  arranged  in  author  order.  The 
title  of  the  bibliography  should  indicate  the  period  covered  and,  if  limited  to  a  particular  type  of  pub’.. cation,  should 

indicate  litis  in  the  title,  i.e  'Bibliography  of  report  literature - ‘  or  ‘Bibliography  of  published  information . 

Bibliographies  should  conform  to  the  house  style  of  the  Information  Centre  and  used  in  a  special  series  with  a  distinctive 
cover. 


5.  STATE  OF  THE  ART  REPORTS  AND  REVIEWS 

State  of  the  art  reports  and  reviews  summarise  recent  progress  in  a  specialised  field  but,  unlike  bibliographies,  involve 
evaluation  of  the  references  cried  Their  compilation  begins  with  an  initial  survey  of  the  relevant  information  in  the  same 
way  as  in  comprehensive  bibliogn  .'hies,  but  the  original  papers  must  be  obtained  and  read.  Some  expertise  or  at  least 
background  knowledge  in  the  subject  reviewed  is  an  advantage  in  the  compiler,  and  if  this  is  not  available  in  the  Centre, 
a  contract  can  be  placed  with  a  suitable  expert,  and  the  literature  resources  of  the  Centre  placed  at  his  disposal.  Successful 
reviews  can,  however,  be  compiled  by  non-specialists  and  Geary*3  who  lias  compiled  a  number  of  surveys  issued  by  the 
British  Scientific  Instrument  Research  Association  describes  a  method  for  use  by  scientists  who  are  not  specialists,  relying 
mainly  on  abstracting  journals  for  the  primary  search,  leading  to  consultation  of  original  papers,  and  discussions  with 
specialists  who  are  accessible.  He  points  out  that  a  non-specialist  will  acquire  specialist  knowledge  during  the  searcli  and 
can  make  a  better  job  of  writing  for  other  non-specialists  who  will  form  the  majority  of  his  readers.  He  recommends  that 
a  review  should  lie  based  on  not  less  than  150  refe'ences,  but  should  not  exceed  400.  Between  six  and  twelve  months 
should  be  allowed  for  its  completion.  The  format  of  the  review  will  consist  of  the  text  followed  by  a  list  of  the  cited 
references  which  preferably  v  ill  include  abstracts.  Format  of  the  review  should  conform  to  the  house  style  of  the  Centre 
for  technical  reports. 


6.  PACKAGED  TECHNOLOGICAL  INFORMATION 


Information  packaged  in  particular  ways  is  an  important  method  of  technological  transfet.  especially  in  assisting 
small  firms  who  do  not  have  information  services  of  their  own.  In  such  cases  information  must  be  presented  in  a  form 
tailored  to  the  user’s  needs.  A  short  abstract  is  seldom  sufficient,  and  engineering  assessments  and  diagrams  can  he 
included  so  that  possible  applications  to  the  user’s  own  production  processes  are  made  clear.  The  source  of  the  informa¬ 
tion  may  be  one  or  more  technical  reports,  journal  articles,  or  ‘spin-off  from  a  defence  or  other  government-sponsored 
programme  which  is  considered  by  the  sponsors  to  be  suitable  for  unlimited  dissemination  outside  the  context  of  the 
original  programme. 


A  good  example  of  this  type  of  serv  ice  is  the  NASA  Tech  Brief.  \  collection  of  Tech  Briefs  is  published  quarterly, 
each  issue  containing  a  section  on  new  product  ideas,  and  individual  Tech  Briefs  classified  under  the  headings: 
electronic  components  and  circuits,  electronic  systems,  physical  sciences,  materials,  life  sciences,  mechanics,  machinery 
fabrication  technology,  mathematic!  and  information  sciences.  (Information  from  Director,  NASA  Technology 
Utilisation  Office.  PO  Box  875.  Baltimore; 'Washington  international  Airport.  Md  21340.)  Ind.vidiuil  Tech  Briefs  are  also 
contributed  to  the  NTIS  Tech  Notes  Service.  A  folder  of  single  page  Notes,  each  illustrating  the  essentials  of  an  item  ot 
applied  technology,  is  issued  every  two  weeks.  Other  contributors  include  the  National  Bureau  of  Standards,  the  Bureau 
of  Mines,  and  the  US  Army,  Navy  and  Air  Force.  Tech  Notes  are  isssued  m  the  subject  categories  Computers,  Electro¬ 
technology,  Energy,  Engineering,  Life  Sciences,  Machinery,  Manufacturing  Materials.  Ordnance.  Physical  Sciences,  testing 
and  instrumentation  In  the  United  Kingdom  a  rather  similar  service  for  UK  industry  was  provided  by  the  Department 
of  Industry’s  Techlink  (Schuler4*).  Each  Techlink  consisted  of  a  one  page  digest,  illustrated  where  necessary,  dealing  with 
a  single  topic  or  new  technological  development,  with  information  on  tiie  source  of  follow-up  information  it  was 
classified  under  one  or  more  of  40  subject  categories  which  were  used  to  effect  selective  distribution.  The  service  in  the 
form  described  was  discontinued  in  l')?S  and  replaced  by  a  TechAlert  service  in  which  Techlink  type  material  is 
submitted  for  publication  in  technical  journals  with  a  wide  circulation.  The  Technical  information  Serv  ice  of  the 
National  Research  Council  of  Canada  prepares  Tech  Briefs  by  analysing  and  .electing  published  material  on  technological 
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innovation  for  Canadian  manufacturing  industry  Eacn  Tech  Brief  is  a  Jigest  of  published  literature  of  about  five  pages 
on  a  particular  topic  and  has  been  described  by  Kirouac45. 

The  preparation  of  such  technological  information  packages  requires  considerable  expertise,  and  knowledge  both  of 
the  subject  and  of  the  customer’s  requirements.  An  engineering  background  and  preferably  experience  in  industry  are 
important  requirements  for  staff  engaged  in  such  work. 


7.  NEWSLETTERS 

A  Centre  serving  a  large  number  of  users  covering  a  wide  area  should  issue  a  periodic  Newsletter,  say  at  quarterly 
intervals.  This  can  be  a  quite  informal  publication  distributed  to  all  users  giving  information  on  topics  including  new 
services  proposed,  staff  changes  in  the  Centre,  and  guidance  on  procedures  such  as  requesting  information  or  documents 
from  the  Centre.  The  main  purpose  of  the  Newsletter  is  to  keep  users  in  touch  and  to  encourage  them  to  make  use  of  the 
services  of  the  Centre  in  the  most  effective  way.  A  Newsletter  should  be  written  with  a  light  touch,  and  readability  is 
an  important  quality .  Projecting  the  image  of  the  Centre  as  a  friendly  helpful  place  and  not  un  impersonal  depe;  inent 
should  be  the  aim  at  all  times. 


8.  THE  FUTURE 

The  increasing  use  of  commercial  data  bases  in  on  line  systems  will  affect  many  processes  m  Information  Cen:res. 
Its  main  effect  will  be  on  the  services  which  involve  the  exploitation  of  published  literature  in  the  Centre,  because  no 
commercially  available  data  base  will  include  controlled  and  security  classified  reports  or  other  material  of  restricted 
distribution  for  which  the  Centre  is  responsible.  As  has  been  indicated,  the  commerical  data  bases  can  be  used  to 
augment  the  Centre’s  services,  and  provide  the  information  which  the  Centre’s  information  scientists  will  repackage. 

New  methods  for  improving  scientific  communication  will  also  affect  information  dissemmauon  by  the  Centre.  In  tins 
connection  the  work  of  the  Capital  Systems  Group,  Inc.  sponsored  by  the  US  National  Science  Foundation  on  innova¬ 
tions  in  information  transfer,  should  be  noted.  Creager46  reports  on  the  publication  of  a  loose-leaf  guidebook,  half 
dealing  with  innovations  and  the  remaindei  with  creation  of  by-products,  pnnt  on-paper  alternatives,  non-pnnt-on-paper 
and  mixed  media  innovations,  trends  and  prospects.  Mention  should  also  be  made  of  a  most  interesting  survey  of 
computer  assisted  writing  and  editing  systems  compiled  by  Berman4’.  This  indicates  the  possibilities  of  using  the  latest 
computer  technology  and  suggests  future  trends  in  automated  publishing  systems. 
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Appendix  1 

EUROPEAN  SPACE  AGENCY  -  ESA  STANDARD  TITLES 


233  Ablation  and  ablative  materials 

101  Adhesives 

208  Aging  of  materials 

214  Air  and  gas  jets 

248  Air  traffic  control  and  collision  avoidance 

221  Aircraft  flight  testing 

247  Aircraft  noise  and  sonic  boom 

102  Aircraft  production  and  costs 

1 83  Air  cushion  vehicles  and  Hovercraft 

1 04  Aluminium  applications 

105  Antenna  properties 

283  Aurorae  and  airglow 

107  Automatic  flight  control 

1 09  Bearings  and  gears 

244  Beryllium 

299  Biometry  and  bioinstrumentation 

1 1 1  Bonds,  joints  and  fasteners 

254  Boron 

1 1 3  Carbon  fibres 

116  Ceramics 

276  Cloud  physics  and  seeding 

117  Coatings  and  paints 

228  Communication  satellites 

237  Communication  systems  for  satellites, 
commands 

205  Component  and  circuit  reliability 

1 1 8  Composite  materials 

1 1 7  Computer  storage  devices  and  memories 
1 1 5  Computers,  on  board 

120  Computerized  design 

1 1 9  Corrosion 

2 1 2  Creep  analysis  and  rupture  strength 

121  Cryogenics 

1 22  Data  acqusition 

1 23  Dielectrics 

280  Earth  atmosphere 

262  Earth  resources:  Agriculture 

265  Earth  resources:  Data  processing  and  sensors 

263  Earth  resources:  Geology  and  geography 

1 26  Elastomers 

1 28  Electric  batteries  and  fuel  cells 

1 1 4  Electronic  display  devices 

223  Electric  propulsion 

1 24  Electronic  packaging 

282  Electrostatic  probes 


207  Environment  simulation 

129  Fibre  technology 

127  Fluidics 

261  Flutter 

257  Friction  and  wear  in  metals 

274  Geodetic  satellites  and  surveys 

130  Graphite 

268  Gravity  gradient  satellites 

25 1  Holography 

132  Hydraulic  and  pneumatic  systems 

131  Hydrology  and  hydrography 

234  Inertial  guidance  an  J  navigation 

133  Information  science 

134  Infrared  technology 

246  Laminate,  honeycomb  and  sandwich 
structures 

241  Laser  applications 

135  Light  sensing  tubes 

213  Linear  and  non-linear  programming 

225  Loads  due  to  gusts/turbulence 

136  Lubricants  and  lubrication 

293  Lunar  composition  and  geology 

286  Magnetometers  and  magnetic  probes 

139  Management,  business  and  administration 

215  Management,  projects 

140  Materials  testing 

138  Metal  forming 

295  Meteorites  and  meteoroids 

145  Microelectronics,  design  and  fabrication 

149  Non-destructive  testing 

148  Nuclear  propulsion 

1 5 1  Oceanography 

152  Operations  research 

219  Optical  telescopes 

300  Orbit  calculations 

202  Outgassing/dcgassing 

275  Ozone 

1 54  Photochemistry 

272  Piezoelectricity 

155  Pipe  flow 

157  Plastics 

255  Plate  buckling 
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158 

Pollution,  air 

185 

Thermal  protection 

142 

Pollution,  noise 

186 

Thermosetting  resins 

141 

Pollution,  water 

238 

Thin  film  properties 

159 

Polymers  and  polymerization 

204 

Time  measurement 

161 

Powder  metallurgy 

209 

Titanium 

164 

Pressure  measurements 

232 

Trajectory  optimization 

167 

Pulse  code  modulation 

190 

Transducers 

178 

Transistors 

166 

Quality  control 

479 

Transistor  circuits 

169 

Real  time  operation 

181 

Transmission  lines 

256 

Reflective  coatings 

191 

Transport  economics  and  management 

229 

Rocket  propellants 

177 

Tropopause/Troposphere 

192 

Tube  heat  transfer 

230 

Satellite  attitude  control 

272 

Seals 

193 

Vacuum  technology 

269 

Self-adaptive  control  systems 

194 

Ventilation  and  cooling 

177 

Semiconductor  diodes  and  rectifiers 

216 

V/STOL  aircraft 

176 

Semiconductor  materials 

196 

Water  purification 

266 

Solar  cells 

146 

Waveguides,  resonators  and  microwave  tubes 

227 

Spacecraft  power  supplies 

27S 

Weather  forecasting 

180 

Stainless  steels 

198 

Welding 

182 

Superconductors 

258 

Temperature  measuring  instruments 

184 

Thermal  insulation 
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